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Title: Expression of alpha-macroglobulins 

FIELD OF THE INVENTION 

The present invention relates to the expression of a-macroglobu- 
51 ins, derivatives and variants thereof, and especially the expression of the 
human o^-macroglobulin (ajfl) in an active form in mammalian cells, and the 
expression of genetically engineered variants thereof. The use of such 
recombinant or-macroglobul ins, especially recombinant a^(rocj^) and variants 
is described with examples from the fields of medicine for therapeutic 
10 purposes, and the development of novel defined growth media for propagation 
of mammalian cells in culture. 

BACKGROUND OF THE INVENTION. 

BIOCHEMISTRY OF a^-MACROGLOBUL IN (qJI) , 

15 The proteinase binding glycoprotein a^, which is synthesized in 

the liver, constitute together with the complement proteins C3, C4 and C5 a 
separate class of structurally and functionally related large plasma 
proteins. For a recent review see (Sottrup-Jensen, L. (1987) in: The Plasma 
Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, 

20 FL). 

Apart from C5 these proteins contain an internal B-cysteinyl- 
-y-glutamyl thiol ester, which enables the proteolytically activated forms 
of a^M, C3, and C4 to participate in characteristic covalent binding reactions 
(Sottrup-Jensen, L., et al . , (1980) FEBS Lett. 121: 275-280; Salvesen, G.S. 

25 and Barrett, A.J., (1981) Biochem. J. 18Z: 695-701). The thiol ester 
structure, which in the active proteins can be slowly cleaved by a number of 
small nitrogen nucleophiles, constitutes a unique type of postsynthetic 
modification of proteins, and plays a prominent role in the biological 
properties of a 2 M. The presence of the active thiol esters in ajfi is revealed 

30 by a characteristic pattern of heat fragmentation (Harpel , P.C., et al . , 
(1979) J. Biol. Chem. 254: 8869-8878). 

Traditionally, o^M has been studied within the context of plasma 
proteinase inhibitors, although by several criteria it is unique. Whereas 
most plasma proteinase inhibitors are monomeric proteins of roughly similar 

35 size, containing approximately 430-500 residues, ajfi is a tetramer whose 180- 
kD subunits contain 1451 residues (Sottrup-Jensen et al . , (1984) J. Biol. 
Chem. 259: 8318-8327). 

Furthermore, in contrast to most other proteinase inhibitors, 
which form 1:1 complexes with serine proteinases engaging the active site 
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of the proteinase and the reactive site of the inhibitor, a* forms complexes 
with a broad spectrum of proteinases differing in their substrate specifi- 
city and catalytic mechanism e.g.: trypsin, leucocyte elastase, chymotrypsin, 
pancreatic elastase, cathepsin G, plasmin, plasma kallikrein and thrombin. 
5 The second-order rate constant for association between these 

proteinases and varies by several orders of magnitude. Both 1:1 and 2:1 
proteinase-^ complexes can be formed, and the disul fide-bridged dimer (360 
kD) appears to be the functional unit of (Sottrup-Oensen, L. (1987) in: 
The Plasma Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press 
lOOrlando, FL) . Contrary to "classical" proteinase inhibitor complexes the a* 
bound proteinase is still active, especially toward small synthetic 
substrates (Sottrup-Oensen, L. (1987) in: "The Plasma Proteins" (Putnam, 
F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, FL) . 

The mechanism of proteinase binding by a 2 M has been described by 
15 the "trap" (Barrett, A.J. and Starkey, P.M. (1973) Biochem. J. 133: 709- 
724), where proteolytic cleavage of a particularly exposed peptide stretch 
near the middle of the 180-kD subunit (the "bait" region) results in a 
conformational change of the ^ tetramer, thereby entrapping the proteina- 
se The nature of the essentially irreversible proteinase complex formation 
20with * 2 M has long remained elusive. However, recent investigations show that 
a major fraction (typically > 80-90 % of the trapped proteinase is also cova- 
lentlybound through epsilon-lysyl (proteinase)-7-glutamyl (O.M) bonds (Sottrup- 
Oensen, L. et al., (1981) FEBS Lett. 128: 127-132; Sand, 0. et al (1985) 
0. Biol. Chem. 260: 15723-15735; Pochon, F. et al . , (1987) FEBS Lett. 217: 
25 101-105). 

puvcininairfli ASPFETS OF PROTEINASE-ttJI INTERACTIONS. 

Since the (^-proteinase complexes are rapidly cleared from the 
circulation (Ohlsson, K. (1971) Acta Physiol. Scand. 81: 269-272; Imber, 
30M.0. and Pizzo, S.V. (1981) 0. Biol. Chem. 256: 8134-8139.) a general role 
as a "clearing vehicle" for plasma proteinases has been envisaged. 

The main physiological targets may include proteinases of the 
coagulation and fibrinolysis systems and plasma kallikrein, and perhaps also 
proteinases like leucocyte elastase, cathepsin G and collagenases and other 
35 proteinases released during cellular turnover (Sottrup-Oensen, L. and 
Birkedal -Hansen, H. (1989) 0. Biol. Chem. 264: 393-401). 

Although ft2 M may be largely confined to the vasculature in healthy 
uninflamed tissues, the inhibitor and its proteinase complexes are found at 
near plasma levels in inflammatory exudates of rheumatoid joints and gingival 
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crevicular fluids (Tollefsen, T. and Saltved, E. (1980) J. Periodont. Res. 
15: 96-106; Borth, W., et al . , (1983) Ann, N. Y. Acad. Sci. 421: 377-381). 

While plasma o^M appear to be synthesized in the liver (Schreiber, 
G. (1987) in: "The Plasma Proteins" (Putnam, F.W., ed) 2nd Ed., 5: 294-363, 
5 Academic Press, Orlando, FL.) other sites of synthesis exist. Several cell 
strains in culture have been shown to produce o^M including fibroblasts 
(Mosher, D.F., et al . , (1977) J. Clin. Invest. 60: 1036-1045) and monocytes- 
/macrophages (Hovi, T., et al . , (1977) J. Exp. Med. 145: 1580-1589). 

Whereas hepatocytes and Kupffer cells of the liver are most 

10 important for clearance of o^M-proteinase complexes in plasma (Davidsen, 0,, 
et al., (1985) Biochim. Biophys. Acta 846: 85-92), fibroblasts (Van Leuven, 
F., et al., (1979) J. Biol. Chem. 254: 5155-5160; Mosher, D.F. and Vaheri , 
A. (1980) Biochim. Biophys. Acta 627: 113-122) and macrophages (Debanne, 
M.T., et al., (1975) Biochim. Biophys. Acta 411: 295-304; Kaplan, J. and 

15 Nielsen, M.L. (1979) J. Biol. Chem. 254: 7323-7328) also possess receptors 
for c^M-proteinase complexes. 

These observations suggest that there may be a considerable 
extravascular turnover of a 2 M perhaps primarily carrying proteinases 
functioning in the cellular micro environment (Sottrup- Jensen, L. and 

20 Birkedal -Hansen, H. (1989) J. Biol. Chem. 264: 393-401). 

SUMMARY OF THE INVENTION 

Briefly stated, the present invention discloses a method for the 
production of recombinant a-macroglobul ins, and especially human a 2 M, and 
25 variants thereof in an active form. 

Within a preferred embodiment, the cultured host cell is an 
eukaryotic cell such as a mammalian cell or cells derived from organisms 
such as insects, plants, yeast or other fungi, such as Aspergillus . 

The invention further relates to DNA sequences comprising a gene 
30 encoding for the expression of human ajft and variants thereof, vectors 
comprising such DNA sequences, and suitable hosts transformed with such 
vectors. 

Yet another aspect of the invention is the use of recombinant 
orgM and variants thereof as a protein carrier in enzyme replacement therapy 
35 (ERT). 

Yet another aspect of the invention is the use of recombinant 
ajft and variants thereof as a DNA carrier in gene therapy. 

Further aspects of the invention relates to the use of recom- 
binant a-macroglobul ins, especially human a 2 M, and variants thereof as 
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constituents of growth media, either as an additive or co-expressed with a 
desired gene product. 

DEFINITIONS 

5 Prior to setting forth the invention it may be helpful for an 

understanding thereof to set forth definitions of certain terms to be used 
hereafter. 

Complementary DNA or cDNA: A DNA molecule or sequence which have been 
lOenzymatically synthesized from sequences present in a mRNA template. 

DNA Construct: A DNA molecule, or a clone of such a molecule, either single- 
or double- stranded, which may be isolated in partial form from a naturally 
occurring gene or which has been modified to contain segments of DNA which 
15 are combined and juxtaposed in a manner which would not otherwise exist in 
nature. 

Plasmid or Vector: A DNA construct containing genetic information which may 
provide for its replication when inserted into a host cell. A plasmid 
20 generally contains at least one gene sequence to be expressed in the host 
cell, as well as sequences encoding functions which facilitate such gene 
expression, including promoters and transcription initiation sites. It may 
be a linear or closed circular molecule. 

25 Joined: DNA sequences are said to be joined when the 5' and 3' ends of one 
sequence are attached by phosphodi ester bonds to the 3' and 5' ends, 
respectively, of an adjacent sequence. Joining may be achieved by such 
methods as ligation of blunt or cohesive termini, by synthesis of joined 
sequences through cDNA cloning, or by removal of intervening sequences 

30 through a process of directed mutagenesis. 

Variant: A peptide related to the original peptide, but wherein the amino 
acid sequence has been altered through mutation of the gene encoding the 
original peptide. 

35 
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ABBREVIATIONS 

AMINO ACIDS 



A 


Ala 


Al anine 


V 


Val 


Val ine 


5 L 


Leu 


Leucine 


I 


He 


Isoleucine 


P 


Pro 


Prol ine 


F 


Phe 


Phenylalanine 


W 


Trp 


Tryptophan 


10 M 


Met 


Methionine 


G 


Gly 


Glycine 


S 


Ser 


Serine 


T 


Thr 


Threonine 


C 


Cys 


Cysteine 


15 Y 


Tyr 


Tyrosine 


N 


Asn 


Asparagine 


Q « 


Gin 


Gl utamine 


D 


Asp 


Aspartic Acid 


E 


Glu 


Glutamic Acid 


20 K 


Lys 


Lysine 


R 


Arg 


Arginine 


H 


His 


Histidine 



NUCLEIC ACID BASES 
25 A = Adenine 
G = Guanine 
C = Cytosine 
T = Thymine (only in DNA) 

U = Uracil (only in RNA) 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure la illustrates the construction of plasmid pll36. 

Figure lb illustrates the construction of plasmid pll67. 
35 Figure 2 illustrates the structure of plasmid pll67. 

Figure 3 illustrates a gel electrophoresis (10 - 20 % SDS-PAGE) 
of the thermal fragmentation products generated from OgM and rcrji. 

Figure 4 illustrates a gel electrophoresis of the thermal 
fragmentation products generated from methyl amine treated a£i and ra^M. 
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Figure 5 illustrates a gel electrophoresis (SDS-PAGE) of the 
reaction products generated from trypsin treatment of a 2 M and ra a M. 

Figure 6 illustrates a gel electrophoresis of the reaction produ- 
cts generated from trypsin treatment of methyl amine- treated a 2 M and ra 2 M. 
5 Figure 7 illustrates a "rate gel" electrophoresis of unreacted 

native -and trypsin treated aS and ra^. 

Figure 8 illustrates a "rate gel" electrophoresis of unreacted 

native -and methyl amine treated aS and ra ^- 

Figure 9 illustrates the chromatograms of o^M and ra 2 M on a 

10 Superose 6 column. 

Figure 10 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from chymotrypsin treated human a 2 M, human 

PZP and ra 2 M-PZP. 

Figure 11 illustrates the gel electrophoresis (10 - 20 % reducing 
15 SDS-PAGE) of the reaction products from elastase treated human * 2 M, human 

PZP and m^-PZP. 

Figure 12 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from trypsin treated human ajl, human PZP 
and ro^-PZP. 

20 Figure 13 illustrates the gel electrophoresis (10 - 20 % reducing 

SDS-PAGE) of the reaction products from Staphylococcus aureus Glu-specific 
protease treated human o^M, human PZP and ra 2 M-PZP. 



?5 DETAILED DESCRIPTION Q F THE INVENTION 

According to the invention there is provided a process for the 
production of a-macroglobulins, especially human a 2 -macroglobulin, or 
fragments or derivatives, including variants thereof, wherein a functionally 
operative expression vector comprising a gene encoding for the expression of 

30a a-macroglobulin, especially human o^-macroglobulin, or fragments or 
derivatives thereof, including variants, or alleles of such a gene, is intro- 
duced into a suitable host capable of expressing said gene, said host is 
cultured in a suitable nutrient medium containing sources of assimilable 
carbon and nitrogen and other essential nutrients, and the expressed a- 

35macroglobulin, especially human ^-.microglobulin, or fragments or derivatives 
thereof is recovered. 

Many proteins synthesized particularly in mammalian cells undergo 
post-translational modification (processing) of one kind or the other. 



REPLACEMENT SHEET 



WO 91/03557 



PCT/DK90/00225 



7 

Depending on the final destination and on the specific function of a newly 
synthesized protein, it may go through a number of processing steps leading 
to covalent modifications such as e.g.: glycosylation, -y-carboxyl ation, B- 
hydroxylation, sulphatation, amidation, thiol ester formation, phosphory- 
5 1ation, proteolytic cleavage at precursor processing sites, fatty acylation 
(Rosner, M.R. (1986). in: "Mammalian Cell Technology", (Thilly, W.G. ed), 
Butterworth Publishers, Stoneham, MA.: 63-89). 

Proteins of various sizes and with a variety of different post- 
-trans! ational modifications have been successfully expressed in transformed 

10 heterologous mammalian host cells using recombinant DNA technology. A few 
examples: Human coagulation factors Vila and IX have been expressed in trans- 
formed BHK (Syrian Baby Hamster Kidney) cells with correct post-trans! ational 
modifications such as 7-carboxyl ation and glycosylation (Thim, L. et al . , 
(1988) Biochemistry 27: 7785-7793; Busby, S. et al . , (1985) Nature 316: 271- 

15 273). Human Platelet-derived Growth Factor AB heterodimer has been expres- 
sed in transformed CHO (Chinese Hamster Ovary) cells with correct processing 
of the A and B chain precursors and correct assembly of the AB heterodimer. 
Human coagulation factor VIII has been expressed in transformed CHO cells 
with correct processing of the precursor leading to a two chain molecule that 

20 can be activated by thrombin and factor Xa (Kaufman, R.J. et al . , (1988) J. 
Biol. Chem. 263: 6352-6362; Pittman, D.D. and Kaufman, R.J. (1988) Proc. 
Natl. Acad. Sci . USA 85: 2429-2433). 

So far, there have been no reports on the heterologous expression 
of proteins in which the formation of an active thiol ester is a prominent 

25 post-translational modification. 

The biosynthesis of the internal thiol ester in the third com- 
ponent (C3) of complement from rabbit has been investigated (Iijima, M. et 
al., (1984) J. Biochem. 96: 1539-1546). Rabbit liver mRNA was translated in 
vitro in a rabbit reticulocyte lysate system, and the synthesized C3 specific 

30 products did not incorporate radio labelled methylamine. On the other hand 
radio labelled iodoacetamide reacted with the synthesized C3 specific 
products; these results indicated the presence in the primary C3 specific 
translation product of a free thiol group instead of a reactive thiol ester. 
If a liver homogenate supernatant (S-13) including cytosol and microsomes was 

35 included, the C3 specific product could now incorporate methylamine. By 
increasing the concentration of the S-13 component(s) , the incorporation of 
methylamine in C3 specific products was increased, and at the same time 
incorporation of iodoacetamide decreased. If the S-13 fraction was treated 
at 65°C for 5 min, the activity was completely lost. 
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The results from this investigation strongly suggest an involve- 
ment of a transglutaminase-like or other type, of enzyme in the posttransla- 
tional formation of an active thiol ester in rabbit C3. There are no similar 
investigations addressing the formation of the thiol ester in other a-macro- 
5globulins, e.g. <*M, but from analogy and homology considerations, it is 
expected that a similar mechanism is responsible for the formation of thiol 
esters in other a-macroglobulins synthesized in the mammalian liver. 

Through this investigation a number of developments were done 
10 which also are deemed to be encompassed of the present invention. These 
include DNA sequences comprising a gene encoding for the expression of a- 
macrogl obul ins, especially human o^macrogl obul i n , or fragments or deriva- 
tives and variants thereof as exemplified in SEQ ID N0:1 and SEQ ID N0:3. 

Another aspect of the invention relates to functionally operative 
15 expression vectors comprising a gene encoding for the expression of at least 
one a-macroglobulin, especially human a 2 -macroglobulin or fragments or 
derivatives and variants thereof, or alleles of such a gene. 

Such vectors preferably further comprise regulatory elements 
necessary for the stable maintenance of said vector in mammalian cells. 
20 Also, such vectors may further include sequences providing for 

the processing and secretion of the expressed product. 

In relation to the use of recombinant a-macroglobulins, and 
especially ra^, in growth media it may be co-expressed with another desired 
gene product, and consequently the vectors of the invention may further 
25 comprise one or more other genes encoding for a desired gene product. 

The invention further relates to transformed hosts comprising a 
functionally operative expression vector according to the invention compri- 
sing a gene encoding for the expression of human a 2 -macroglobul in or fragments 

30 or derivatives and variants thereof, or alleles of such a gene. 

The host may be selected from the group comprising a bacterial 
strain, a fungal strain, a mammalian cell line, or a mammal, especially a 
fungus, such as belonging to the genus Aspergillus, or a yeast strain, pre- 
ferably belonging to the genus Saccharomyces. 

35 Another preferred type of host is a mammalian cell line, 

preferably a Syrian Baby Hamster Kidney (BHK) cell line, and especially the 
one which is available from ATCC under No. CRL 1632. 
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The invention further relates to the recombinant human Og- 
macroglobul in or a variant thereof in an active form having the amino acid 
sequence of SEQ ID N0:2, or SEQ ID N0:4. 

5 APPLICATIONS OF a-MACROGLOBULINS. ESPECIALLY raJi. 

Th£ present invention discloses applications of a-macroglobul ins, 
and especially rct>M. These should be regarded not as limitations but as a 
few examples among many for the use of recombinant derived a-macroglobul ins. 

10 or-MACROGLOBULINS AS CONSTITUENTS OF DEFINED GROWTH MEDIA. 

Degradation of specific heterologous products produced in either 
transformed or non- transformed mammalian cells is a potential problem in the 
production of recombinant products. This is due to the fact that many host 
cells secretes one or more different proteinases. 

15 When a production cell line is grown in the presence of e.g. 10 

% fetal calf serum, such proteolytic degradation of secreted recombinant or 
native protein products is a minor problem due to a buffering effect of the 
added serum proteins. 

However, the use of fetal calf serum in the large scale growth 

20 (fermentation) of mammalian production cell lines is not a desirable 
situation for a number of reasons. First of all fetal calf serum is a very 
costly constituent of complex growth media; second, the demand for fetal 
calf serum from a growing biopharmaceutical industry might not be easily 
fulfilled in the future, and third, the use of fetal calf serum constitutes 

25a potential quality control problem in the production of pharmaceuticals 
intended for use in humans. 

To circumvent these problems, efforts can be expected in the 
field of development of defined growth media for use with mammalian cells. 

Addition of various proteinase inhibitors to such new defined 

30 growth media will be required to ensure the integrity of the secreted 
products. Alternatively, the producer cell line might, through genetic 
engineering, be endowed with the capacity to produce and secrete proteinase 
inhibitors along with the desired product(s). 

a-Macroglobulins, and especially Human o^M, are proteinase 

35 inhibitors of broad specificity, and they are therefore according to the 
invention used as constituents of defined growth media for mammalian cells, 
either as a medium additive or as a product co-produced with the desired 
product. 
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The target sites for a number of different proteinases, e.g. 
bovine trypsin, Streotomvces ariseus trypsin, papain, porcine elastase, 
bovine chymosin, bovine chymotrypsin, Staphylococcus aureus strain V8 
proteinase, human plasmin, bovine thrombin, thermolysin, subtilisin Novo and 
5 Strentomvces ariseus proteinase B have been mapped in the bait region of 
human a^ (Mortensen, S.B., et al., (1981) FEBS Lett. 135: 295-300) and other 
a-macroglobulins (Sottrup- Jensen, L., Sand, 0., Kristensen, L. and Fey, G.H. 
J.Biol .Chem. 264,15781-15789, 1989). It is evident that a 2 M and the other a- 
macroglobulins as proteinase inhibitors have broad specificities. 
10 In those situations, where the proteinase inhibitory spectrum 

of a a-macroglobulin, such as a 2 M, is not sufficient for the prevention of 
product degradation, it is possible through site specific mutation, protein 
engineering, etc. to change the proteinase inhibitor specificity of the a- 
macroglobulin, such as a^. Incorporation of desirable specific proteinase 
15 target sites in the bait region of recombinant o^M will change the inhibitor 
specificity of the mutated a 2 M. Furthermore it is possible through genetic 
engineering to construct novel specific or general proteinase target sites 
in the bait region of a a-macroglobulin in order to enhance its versatility 
as a proteinase inhibitor of specific or broad inhibitory spectrum. 
20 Furthermore it is possible to remove specific target sites in an a- 
macroglobulin in order to avoid degradation of the variant in question by 
certain proteases in the circulation that will already be inhibited through 
the action of naturally present proteinase inhibitors. 

The production of recombinant products in fungi, such as species 
25 and strains of e.g. Aspergillus and Saccharomyces also meets with potential 
problems of product degradation. In some cases it is possible to isolate 
proteinase negative mutants of desirable production strains. This might not 
always be the case, and co-expression of a-macroglobulins, such as a 2 M or 
a 2 M-mutants together with a desirable product may inhibit proteolysis of the 
30 product in question. 

tt-MACROGj OBULIN MUTANTS AS SPECI FIC PROTEINASE INHIBITORS. 

The amino acid sequence of the bait region of a-macroglobulins 
defines the specificity of the a-macroglobulin towards different proteina- 
35ses. A comparison of cleavage patterns for different proteinases and bait 
region sequences in five mammalian a-macroglobulins has recently been 
published (Sottrup-Jensen, L., Sand, 0., Kristensen, L. and Fey, G.H. The 
a-macroglobulin bait region. Sequence diversity and localization of cleavage 
sites for proteinases in five mammalian a-macroglobulins. J. Biol . Chem. 264, 
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15781-15789, 1989). It has previously been clearly demonstrated that the bait 
region in each species of a-macroglobul in is the major determinant of 
proteinase inhibitor specificity. The present invention demonstrates the 
possibility of modulating the inhibitor specificity of human ajft by 
5 alterations of proteinase target sites in the bait region. 

In the present invention it is demonstrated that the bait region 
of human o^M (residues 690 to 730 in SEQ ID NO: 2} can be mutated at will to 
obtain a new proteinase inhibitor profile of this macroglobul in. The example 
presented in the present invention describes the construction of a hybrid 

10 macroglobul in. In this hybrid the bait region from human pregnancy zone 
protein (PZP) was introduced into human CKjM, from which the native bait region 
had been removed. The hybrid molecule, which was constructed by the use of 
recombinant DNA technology, revealed a proteinase inhibitor profile similar 
to the inhibitor profile of PZP. 

15 The invention thus demonstrates the possibility to design and 

produce proteinase inhibitors with altered and new inhibitor specificities 
at will . 

This finding is important for the design of new proteinase 
inhibitors. Due to the low antigenicity the bait region in macroglobul ins 

20 (Van Leuven, F., Marynen, P., Cassiman, J. -J. and Van den Berghe, H. Mapping 
of structure-function relationships in proteins with a panel of monoclonal 
antibodies. A study on human alpha-2-macroglobulin. J. Immunol. Methods 111 , 
39-49, 1988, and Delain, E., Barray, M. , Tapon-Bretaudiere, J., Pochon, F. , 
Marynen, P., Cassiman, J. -J., Van den Berghe, H. and Van Leuven, F. The 

25 Molecular Organization of Human alpha2-Macroglobul in. An Immunoelectron 
microscopic study with monoclonal antibodies. J. Biol . Chem. 263 , 2981-2989, 
1988) it is now possible, by the use of the technology described in the 
present invention, to design non-immunogenic new proteinase inhibitors that 
can be used e.g. in the treatment of any disease, where aggressive proteina- 

30 ses constitute a threat to the health of man. 

In the present specification the production of a^l variants is 
described by the construction of a hybrid macroglobul in. It is clear to the 
skilled person in the art that changes also could be obtained through other 
genetic engineering methods, such as described in International Publication 

35 No. W0 89/06279 (NOVO INDUSTRI A/S). Also it is clear that other a- 
macroglobulins could be employed instead of the human ajft, such as those 
mentioned in Sottrup- Jensen, L. et al . (1989), supra . 
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F tt a MAS_A PROTFTN CARRTFR TN FNZYMF RFPI ACFM ENT THERAPY^ 

A different application of tta M is .its use as a carrier of macro- 
molecules such as proteins and nucleic acids. When reacts with and forms 
a complex with a proteinase in solution, *M may bind other proteins (a so 
5non-proteinase proteins) present in that solution (Salvesen, G.S. et .1., 
(1981) Biochem. 0. 195: 453-461). In the case of Fabry's disease, which is 
an X-chromosome linked disorder of glycosphingol ipid metabolism, it has 
recently been demonstrated that a 2 M can function as a carrier in an ULVltro 
model of enzyme replacement therapy (ERT) (Osada, T., et al . , (1987) Biochem. 
lOBiophys. Res. Commu. 142: 100-106). <*N was conjugated to coffee bean «- 
galactosidase through the action of trypsin, and the formed complex was 
internalized through a^-receptor specific (Van Leuven, F., et al . , (1981) J. 
Biol. Chem. 256: 9016-9022) endocytosis and delivered to the lysosomes, which 
is the target organelle for a^-receptor mediated internalization of a 2 M- 
15 proteinase complexes (Willingham, M.C. and Pastan, I., (1980) Cell 21: 67- 

?7) Such a scheme in ERT provides a method of internalization to the 

lysosome of the enzyme in question and at the same time it might alleviate 
potential antigenicity problems arising from the use of heterologous enzymes 
20in therapy. One limitation in this type of ERT (Osada, T., et al., (1987 
Biochem. Biophys. Res. Commu. 142: 100-106) would be the types of potential 
target cells that could be treated by this protocol. Obviously, they would 
have to express the ^-receptor. In a future development of the system, the 
possibility might exist to redesign the cell specificity of a 2 M internaliza- 
25tion by exchanging the receptor binding domain of with other receptor 
ligands. Hereby o^-mutants could be designed to enter any cell type known 
to express a specific internal izable receptor. 

This type of development would of course require a system for 
the production of recombinant derived qjl. The use of native human o^M as a 
30 carrier in ERT (as described above) is undesirable due to the now well known 
risks of the employment of blood derived products in the treatment of human 
disease. 

The production of recombinant a 2 M in accordance with the present 
invention alleviates this problem by providing for large scale production 
35 of rotaM. 

rajjiii DNA CARR IE ™ £EJjE THERAPY. 

Advances in gene transfer into mammalian cells have opened for 
the possibility of the treatment of a number of genetic disorders through 
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gene therapy. A major problem in gene therapy will be the specific targeting 
of genes into the appropriate cells within the body. (Williamson, B., (1982) 
Nature 298: 416-418; Anderson, W.F., (1984) Science 226: 401-409; Parkman, 
R., (1986) Science 232: 1373-1378). 
5 It was recently described that a constructed foreign gene 

containing the chloramphenicol acetyl transferase (CAT) on a bacterial plasmid 
could be targeted to the liver of rats by specific receptor directed 
internalization (Wu, G. Y. and Wu, C.H. (1988) J. Biol. Chem. 263: 14621- 
14624). The DNA carrier consisted of a galactose-terminal (asialo)glyco- 

10 protein and asialoorosomucoid covalently linked to poly-L-lysine. The 
polycation poly-L-lysine can bind DNA in a strong non-covalent and nondamag- 
ing interaction. It was demonstrated that complex bound DNA was internalized 
by cell -surface asial ©glycoprotein receptors that are unique to hepatocytes. 
The complex was injected intravenously, and upon analysis only the liver 

15 expressed the CAT activity. 

In the present invention the use of ro^ as a carrier of DNA in 
gene therapy is suggested. Reaction of rofgM with a proteinase such as trypsin 
or with methylamine in the presence of covalently closed circular plasmid DNA 
is likely to result in partial or total entrapment of DNA within the 

20complexing a 2 M molecule. After intravenous injection of such complexes with 
exposed receptor binding domains, the complex will be rapidly cleared from 
the blood and internalized in specific target cells, such as hepatocytes and 
Kupffer cells. Through protein engineering on the receptor binding domain of 
ra 2 M it will be possible to design a DNA carrier specific for other cell 

25 types. The advantage in this system as compared to the above described system 
using the asialoglycoprotein receptor is, that it will not be necessary to 
identify different DNA carrier systems for each new cell type. 

30 EXAMPLES 

Materials and methods: 
Microorganisms and cell lines 

E. coli K12 (MC1061) is available from e.g. Stratagene Inc., 
35 11099 North Torrey Pines Rd., La Jolla, California 92037. 

HepG2 (Human hepatoblastoma cell line) is freely available from 
American Type Culture Collection, under No. HB 8065. 

BHK (Syrian Hamster Kidney cell line, thymidine kinase mutant 
line tk l sl3, (Waechter and Baserga (1982) Proc. Natl. Acad. Sci . USA 79: 
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1106-1110); is freely available from American Type Culture Collection, under 
No. CRL 1632. 

Plasmids and vectors 

5 Plasmids pCDVI-PL and pSP62-K2 are available from Dr. Tasuku 

Honjo, Faculty of Medicine, Kyoto University, Kyoto 606, Japan. P SP62-K2 was 
derived from the plasmid pSP62-PL (available from New England Nuclear/Du 
Pont (U.K.) Ltd., Wedgwood Way, Stevenage, Hertfordshire, SG14QN) as 

lOdescribed (Noma et al., (1986) Nature, 319: 640-646). pCDVI-PL was derive 
from pcDVl (Okayama, H. and Berg, P. (1983) Molec. cell. Biol. 3: 280-289) 
as described (Noma et al., (1986) Nature, 319: 640-646). 

M13mpl8 is available from Pharmacia LKB Biotechnology (catalog 
# 27-1552-01) (Norrander, J., Kempe, T. and Messing, J. Gene 26: 101-106, 

15 1983) 

M13mpl9 is available from e.g. International Biotechnologies, 
Inc., P.O. Box 9558, 275 Winchester Avenue, New Haven, Connecticut 06535, 

USA. „ . 

pDHFR-I is available from Dr. K.L.Berkner, ZymoGenetics Inc., 

20 4225 Roosevelt Way NE, Seattle, Washington 98105. (The construction of this 

plasmid is given in detail in: Berkner, K.L. and Sharp, P.A. (1984) Nucleic 

Acids Res. 12: 1925-1941). The molecular cloning of the DHFR cDNA present 

in this plasmid, and its sub-cloning in mammalian expression vectors under 

the control of adenovirus derived promoters has previously been described 

25in detail (Chang, A.C.Y., et al., Nature 275: 617-624 and Kaufman, R.J. and 
Sharp, P.A. (1982) Mol . Cell. Biol. 2: 1304-1319) . The backbone plasmid in 
pDHFR-I is pBR322 (Sutcliffe, J.G. (1979) Cold Spring Harbor Symp. Quant. 
Biol. 43: 77-90; Sutcliffe, J.G. (1978) Nucleic. Acids Res. 5: 2721-2728). 

P UC13 is described in: Vieira, J. and Messing, J.: 1982, Gene 19: 

30 259-268 and available from Pharmacia LKB Biotechnology (catalog # 27-4954- 

01> pUC19 is described in: Yanisch-Perron, C. and Messing, J., 1985, 

Gene 33:103-119 and available from Pharmacia LKB Biotechnology (catalog # 
27-4951-01). 

35 
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Growth media 
LB- broth: 

Mix 227 g Bacto Tryptone, Difco 0123-01 

113.5 g Yeast extract, Difco 0127-01, and 
5 227 g NaCl in a seal able plastic container. 

Add 12,5 g mix to 500 ml water in a 1000 ml bottle, shake well and sterilize 
in an autoclave. 

Dulbeccos Modified Eagle Medium is available from e.g. Gibco Ltd. 
10 P.O. Box 35, Trident House, Renfrew Road, Paisley PA34EF, Renfrewshire, 
Scotland. Cat.# 042-250 1M (10 * concentrate). 

Antibodies 

15 • Anti-c^M A033 and peroxidase conjugated anti-o^ PE326 were from 

DAKOPATTS A/S, Copenhagen, Denmark. 

EXAMPLE 1. 

CLONING AND SEQUENCE DETERMINATION OF HUMAN a„M 

20 

Preparation of messenger RNA from the human cell line HepG2. 

The human hepatoblastoma cell line HepG2 (American Type Culture 
Collection No. HB 8055, freely available) was used as a source for mRNA 
preparation. HepG2 cells were grown to a total cell number of 15 * 10 7 in 
25Dulbecco's Modified Eagle medium containing 10% fetal calf serum and 
antibiotics. 

Total RNA was isolated by the guanidinium thiocyanate method 
(Chirgwin et al . , (1979) Biochemistry 18: 5293-5299) and purified by CsCl 
gradient centrifugation. A total of 3000 /xg RNA was obtained. mRNA was 

30 isolated by use of an ol igo(dT) -cellulose column (Aviv & Leder (1972) Proc. 
Natl. Acad. Sci. USA 69: 1408-1412). 60 /ig of mRNA was obtained after one 
cycle of affinity chromatography. After ethanol precipitation, this 
preparation of mRNA was resuspended in 10 mM Tris-HCl pH 7.5, 0.1 mM EDTA- 
Na 2 at a final concentration of 1 /ig//il and stored at -80°C for subsequent 

35 use in the construction of a cDNA library. 

Construction of a cDNA library from HepG2 mRNA. 

A cDNA library was constructed in the pCDVI-PL/pSP62-K2 vectors 
(Noma et al . , (1986) Nature, 319: 640-646. Available from Dr. Tasuku Honjo, 
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Faculty of Medicine, Kyoto University, Kyoto 606, Japan) by use of the 
methods described by Okayama & Berg (Hoi. Cell. Biol. 2: 161-170 (1982); 
Mol. Cell. Biol. 3: 280-289 (1983)). 

E, con K12 (MC1061) (Casadaban & Cohen (1980) J. Mol. Biol. 
5 138/ 179-207) was used for transformation. MC1061 were grown in L-broth at 
37'C to OD^O.5. Twenty ml were centrifuged, and the pellet was resuspended 
in 7 ml of ice-cold sterile 0.1 M CaCl 2 , incubated on ice for 30 minutes, 
centrifuged briefly, and finally kept in the cold room overnight. 

Ninety-five fi\ suspension of transformation-competent E. coli 
10MC1061 were added per 10 fi\ of cDNA preparation. The mixture was incubated 
on ice for 30 minutes, heat-shocked at 43,5'C for 45 seconds, and finally, 
after addition of L-broth, incubated at 37'C for 30 minutes. 

After resuspension, the cells were plated onto L-broth plates 
containing ampicillin (50 /ig/ml) and grown for 8 hrs at 37'C. A total of 2.9 
15*10 5 individual colonies could be obtained from this library. 

Qrrppnina nf the Hepfi? library f nr cDNA clones pnr.oding human 

5 * 10 4 individual colonies were screened by standard colony 
hybridization technique using nitrocellulose filters (Maniatis et al., (1982) 
ZOMolecular Cloning - A Laboratory Manual, Cold Spring Harbor, New York). 
A 20-mer oligonucleotide mixture 
5' CC(T/C)TTCAT(G/A)TC(T/C)TC(T/C)TG(T/C)TT 3' 
where the notation (X/Y) means that either of the nucleic acids X or Y may 
be used, complementary to the human aj* mRNA in the region encoding amino 
25 acid residues Lys-Gln-Glu-Asp-Met-Lys-Gly (residues number 493 - 499 in 
Sottrup-Jensen et al., J. Biol. Chem. 259: 8318-8327 (1984) was synthesized 
(on a DNA synthesizer from Applied Biosystems, USA), labelled with "P (using 
T 4 polynucleotide kinase and -y-'P-ATP) to a specific activity of 3 * 10 
cpm/pmol oligonucleotide. The labelled oligonucleotides were purified by gel 
30 chromatography and subsequently used in the screening of the cDNA library. 

The hybridization solution contained 6 * SSC, 5 * Denhardt's 
solution, 0.05% SDS (Maniatis et al . , (1982) Molecular Cloning - A Laboratory 
Manual, Cold Spring Harbor, New York) and 10 e cpm/ml of labelled oligo- 
nucleotide mix. 

35 Hybridization was performed for 3 hrs at 45'C. Then the filters 

were washed in 6 * SSC, 0.05% SDS at 45'C for 3 * 10 minutes. After- autora- 
diography the filters were washed under the same conditions, but this time 
at 52'C. A colony that still showed hybridization at this temperature was 
isolated and the cDNA insert of the corresponding plasmid (designated po^M) 
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from this isolate was sequenced (Tabor & Richardson (1987) Proc. Natl. Acad. 
Sci. USA 84- 4767-4771). The sequence of the cDNA and the derived encoded 
amino acid sequence are shown in the appended sequence listings, SEQ ID 
N0:1:, and SEQ ID N0:2:. 

5 

Characterization of pct-H. 

potjjM had a cDNA insert of approximately 4.6 kb. Its sequence is 

given in Table I above. 
10 The sequence in Table I demonstrates that the entire coding 

region of o^M including the signal peptide is found in the insert. 

In addition to the coding region, the insert contains sequences 

derived from the 5'- and 3' untranslated regions of the c^M mRNA molecule. 

The amino acid sequence of the human o^M as deduced from the cDNA 
15 in po^M is in total agreement with the published sequence (Sottrup-Jensen et 

al., (1984) J. Biol. Chem. 259: 8318-8327). Codon number 1000 (numbered from 

the initiating methionine codon in the signal peptide) was found to be ATC 

encoding an isoleucine and not GTC (encoding a valine) as found in an cDNA 

synthesized from human liver mRNA (Kan et al., (1985) Proc. Natl. Acad. Sci. 
20 USA. 82: 2282-2286). In the a 2 M cDNA sequence from the HepG2 library we have 

further identified ten silent changes as compared to the sequence from the 

liver library, see the following Table I: 
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5 




TABLE I 




Codon 


Liver 






413 (Asn) 


AAC 


MM 1 




495 (Phe) 


TTT 


TTP 


10 


750 (Gly) 


GGG 


GGT 




796 (Leu) 


CTT 


CTC 


15 


835 (Leu) 


CTT 


L I A 




1266 (Ala) 


err 


GCA 




1296 (Asn) 


AAT 


AAC 


20 


1326 (Thr) 


ACC 


ACA 




1442 (Leu) 


CTC 


CTG 


25 


1460 (He) 


ATC 


ATT 



The position of the oligonucleotide mixture used as a hybridiza- 
tion probe in the colony screenings was from position 1574 to position 1594, 
30 and the position of the reactive thiol ester is from position 2939 to 2953 
in SEQ ID N0:1. 



EXAMPLE 2. 

pnnctrnrtion of a mammalian ex pression vector for aJk 

35 pa 2 M was digested (fig. la) with Xbal and EcgRI, and a 1.2 kb 

fragment containing the 5' part of the a* cDNA together with the multiple 
cloning site of pSP62-K2 was isolated on an agarose gel and cloned in an 
Xbal/EcoRI digested M13mpl9 vector to generate M13mpl9A. To facilitate 
further subclones of the o& cDNA, a unique EcoRV site was introduced in 

40the 1.2 kb fragment 10 nucleotides 5' to the initiating ATG (methionine) 
codon through site directed mutagenesis (Kunkel et al., (1987) Methods 
Enzymol. 154: 367-382). In the same mutagenesis experiment, in which the 

mutagenic oligonucleotide NOR593: 

5 ' (TTCTTCCCCATGGTGGATATCGAAGG AGCTG ) 3 ' 

45was used, the 5 nucleotides 5' to the methionine codon was changed to 
CCACCAJG; this mutation creates a new Ncol site spanning the ATG codon. A 
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correct mutant M13mpl9B was identified through restriction enzyme digestion 
and DNA sequencing. 

The mutated 5' end of cr^ cDNA was isolated from M13mpl9A repli- 
cative form through digestion with Hin dHI and Eco RI and agarose gel electro- 
5 phoresis. The isolated DNA fragment was then joined to Hin dHI/ Eco RI digested 
pot^l through ligation to generate pll36. In this plasmid the OgM cDNA is 
reassembled in its total length, but now with a unique EcoRV site at the 5' 
end. pll36 was digested with EcoRV/Dral, and the ajfi fragment was isolated on 
an agarose gel and cloned in a mammalian expression vector under control of 

10 the adenovirus 2 major late promoter (Ad 2 MLP) . 

The adenovirus-promoter based vector was constructed by K.L.Berk- 
ner (ZymoGenetics Inc., Seattle, WA.), and a detailed description of the 
functional elements in the mammalian expression vector is given in: Powell, 
J.S. et al., (1986) Proc. Natl. Acad. Sci . USA 83: 6465-6469 and in: Boel 

15 et al., (1987) FEBS Lett. 219: 181-188). 

The expression vector used for expression of human a 2 M was 
generated from the mammalian expression vector pPP (Boel, E. et al . , (1987) 
FEBS Lett. 219: 181-188), in which human pancreatic polypeptide cDNA was 
cloned under control of Ad 2 MLP. 

20 pPP was digested (fig. lb) with BamHI and the resulting stag- 

gered ends were repaired with DNA polymerase (Klenow fragment and the four 
deoxynucleotide triphosphates). The 4.5 kb EcoRV/Dral a 2 M cDNA fragment was 
joined to this vector through ligation, and correct recombinants were 
characterized through restriction enzyme analysis on isolated miniprep. 

25 pi asmids. 

The a 2 M-mRNA transcribed from the resulting 8.76 kb plasmid 
(designated pll67 (fig. 2)) has the adenovirus 2 late tripartite leader (Ll- 
3) at its 5 ' end together with an mRNA splice signal (SS). At the 3' end of 
the construct the transcript is terminated with the SV40 late termination - 
30 and polyadenylation signal. 5' to the Ad 2 MLP the construct includes the 
SV40 enhancer (ENH) and the 0 to 1 (0 - 1) map units from adenovirus 5. 

Expression of ouM in mammalian cells. 

For expression of human a 2 M in cultured BHK cells (Syrian Hamster 
35 Kidney, thymidine kinase mutant line tk' l sl3, (Waechter and Baserga (1982) 
Proc. Natl. Acad. Sci. USA 79: 1106-1110); American Type Culture Collection 
CRL 1632) the expression vector pll67 was co-transfected with pDHFR-I (Berk- 
ner, K.L. and Sharp, P. A. (1984) Nucleic Acids Res. 12: 1925-1941. Available 
from K.L.Berkner, ZymoGenetics Inc. Seattle) into subconfluent cells by the 
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calcium phosphate mediated transfection procedure (Graham and Van der Eb 
(1973) Virology 52: 456-457). In the transfect.ion experiment the molar ratio 
between pll67 and pDHFR-I was 10:1. Cells were grown in Dulbeccos Modified 
Eagle Medium supplemented with 10% fetal calf serum (FCS). 
5 Forty-eight hours after transfection, cells were trypsinized and 

diluted into medium containing 400 nM methotrexate (MTX). After 10 to 12 
days individual colonies were cloned out and expanded separately. The 
expanded cultures were propagated for 24 hours as described above, and 
producer clones were identified using an enzyme linked immunosorbent assays 
10(ELISA) (Munck Petersen C, et al., (1985) Scand. J. Clin. Lab. Invest. 45: 
735-740) against human a 2 M secreted to the growth medium. 

np^rription of t hp nM ELISA assay. 

The materials used in the ELISA were: 

IS Catching antibody A033 anti-o^M, 

Peroxidase-conjugated anti-cr 2 M antibody PE326, 
1,2-Phenylenedi amine, di hydrochloride (0PD) 
all from DAKOPATTS A/S, Copenhagen, Denmark. 
Urea peroxide, 125 mg, was from Organon Teknika. 

20 96 well ELISA plates were from NUNC, Copenhagen. 

Coating buffer: 

100 mM carbonate buffer pH 9.6 was made up as follows: 
Add 3.18 g Na 2 C0 3 and 5.96 g NaHC0 3 to 1000 ml water. 

25 

Standard and sample buffer: 

To 100 ml of 150 mM phosphate buffer pH 7.2 was added: 

50 Ml Tween 20 

2 g Bovine Serum Albumin (Sigma A 7030). 

30 

Washing buffer: 

10 mM sodium phosphate pH 7.4 
145 mM sodium chloride 
0.1 % Tween 20. 

35 

Citric acid-phosphate buffer, pH 4.9: 

The following reagents were added to 1000 ml of water 
7.3 g citric acid 
23.88 g Na 2 HP0„ 12 H 2 0 
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0.5 ml Tween 20 

The buffer was used for a maximum of 14 days, stored at 4°C. 

Urea peroxide solution: 
5 125 mg urea peroxide was dissolved in 8.93 ml water. 

The solution was kept in the dark at 4°C. 

Coating of the plates for assay: 

The 96 well plate was coated with 175 /il of the DAKO A033 
10 antibody diluted 1:1000 in the coating buffer. The plate was incubated over 
night at 4°C. Before use the plate was washed 4 times in washing buffer. 

Application of standards and samples: 

100 fi\ standard or sample was added to each well. As a standard 

15 purified human a 2 M, 2 mg/ml (prepared as described in: Sottrup- Jensen et al . , 
(1983) Ann. N.Y. Acad. Sci . 421: 41-60) was used. The standard curve included 
the following serial dilutions: 1:4000, 1:8000, 1:16000 etc. down to 
1:1024000, corresponding to final concentrations from 500 pg/l down to 1.95 
Mg/1 . All dilutions were done in the Standard and sample buffer. The plate 

20 was incubated over night at 4°C and then washed 4 times with wash buffer 
before the next step. 

Addition of conjugated antibody: 

100 il\ of PE326, which had been diluted 1:6000 in the Standard 
25 and sample buffer, was added to each well. The plate was incubated for 2 h 
at 20°C, and then washed 4 times with wash buffer. 

Enzyme activation: 

8 mg of OPD was dissolved in 12 ml of Citric acid- phosphate 
30 buffer. To this solution 500 fil Urea peroxide solution was added and the 
mixture was used immediately. 100 pi of the final solution was added to each 
well, and the plate was incubated in the dark for 6 min. Then 100 il\ of 2 M 
H 2 S0 4 was added to each well and the A^ was read in an automated ELISA plate 
reader. 

35 

The above described ELISA did not give any background on medium 
supplemented with 10% FCS, nor did it give any background in BHK cell 
conditioned medium. Of 24 isolated MTX resistant clones, 16 produced 
detectable amounts of recombinant a^. 
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Selected cell lines that secreted 12.3 mg/1 (K16-6) and 19.1 
mg/1 (K17-6) in the supernatant (grown in a 6 well NUNC-plate) over a 48 
hour period were expanded for large scale production of recombinant human 
(ro^M) . 

5 

Pnrifi cation of rer nmhinant human ot^L 

Cell Tines K16-6 and K17-6 were each expanded into one ten- 
double tray (NUNC, Denmark) with a growth surface of 6000 cm 2 . At 80% 
confluency the medium on the cells was changed from containing the 10% fetal 
10 calf serum (FCS) down to 2%. After 48 hours of growth in medium with only 2/. 
(PCS), the medium was removed, and the cells were washed twice with serum 
free medium. Cells were then grown serum free for 4 to 5 days with change of 
serum free medium every two days. Conditioned medium was pooled and analyzed 
for rc^M by ELISA. 

15 * The pooled conditioned medium from K16-6 and from K17-6 contained 

7.15 mg/1 and 21.5 mg/1 of ra^, respectively. 

The ra^ was purified according to published procedures (Sottrup- 
Oensen et al., (1983) Ann. N. Y. Acad. Sci . 421: 41-60). Briefly the 
conditioned medium was loaded onto a 10 ml Zn-Chelate column (Zn - 

20 iminodiacetic acid Sepharose 4B (Porath, 0. et al . , (1975) Nature 258: 598- 
599) equilibrated with 25 mM Tris-HCl pH 8.0, and washed with 100 ml 
phosphate buffered saline (PBS) pH 7.2 until A^ < 0.036. A second wash with 
20 mM sodium phosphate, 500 mM NaCl pH 6.2 was performed until A^ < 0.033. 
The flow rate was 100 ml/hr and 3 ml fractions were collected, ro^ was eluted 

25with 100 mM EDTA pH 7.0 at a flow rate of 40 ml/hr. During elution 1 ml 

fractions were collected. 

Recovery of r^M was 44%. The ra^ containing fractions were con- 
centrated to 1 ml on an Amicon devise equipped with a PM 10 membrane and 
then loaded onto a Superose 12 gelf iltration column (25 mM Tris-HCl, 150 mM 
30 NaCl pH 8.0). The rccfl containing fractions were pooled and stored at -20«C 
until analysis. 



FXAMPLE 3. 

Character! ration of rprombinant human ror-M... 

fl rhpmiral re^on. at th e thim «st«r: thPrmal fragmentation and 
methyl amine induced rleavaae. 
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A number of different analyses were performed to evaluate the 
structural and biological characteristics of the human ro^M as compared to 
a preparation of human plasma derived o^M, designated preparation LSJ39. 

An important structural feature of is the presence of the 
5 thiol ester. When heated to 95°C for 15 min, the thiol ester will induce a 
peptide bond cleavage in the backbone of a^M at the position of the thiol 
esterified Glx-residue. This results in the fragmentation of the 180 kD 
monomer into two polypeptides of 120 kD and 60 kD- Fig. 3 shows an analysis 
of both the purified ro^M (from two transformed BHK cell lines) and the 

10 purified human plasma derived preparation LSJ39 on a 10-20% SDS polyacryl- 
amide gel. The different preparations, either native human or BHK cell 
derived recombinant o^M were all heat treated to induce thermal fragmenta- 
tion before loading onto the gel. Molecular weight markers (from top to 
bottom: 180, 120, 92, 60, 43, 26, 14 and 6 kD) were applied to lanes 1 and 

15 8. Samples in lanes 2, 3 and 4 were not reduced before electrophoresis, while 
samples in lanes 5, 6 and 7 were reduced. Preparation LSJ39 was applied to 
lanes 2 and 5. ra 2 M K16-6 was applied to lanes 3 and 6, and ra->M K17-6 was 
applied to lanes 4 and 7. 

It was clear from the patterns of protein fragments on the gel, 

20 that both human and the two rajft preparations showed a considerable degree 
of thermal fragmentation. As expected, only the reduced samples displayed 
this fragmentation. In the nonreduced samples, the molecules migrated as the 
360 kD dimer. 

In the human plasma derived preparation LSJ39 (lane 5) a fragment 
25 migrating slightly faster than the 60 kD fragment could be observed. Lanes 
6 and 7 indicated the presence in the recombinant material of a similar 
faster migrating fragment. It is possible that this fragment represented a 
slightly underglycosylated variant of the 60 kD fragment. 

Methyl amine (MA) and other small nitrogen containing nucleo- 
30philes will cleave the thiol ester and thereby inactivate the ester (Sottrup- 
Jensen, L., et al . , (1980) FEBS Lett. 121: 275-280; Salvesen, G.S. et al., 
(1981) Biochem. J. 195: 453-461). After MA induced inactivation of the thiol 
ester, thermal fragmentation of or 2 M can no longer be observed. 

Fig. 4 shows a SDS-PAGE run similar to that shown in Fig. 3 (with 
35 respect to loaded samples), in which applied ctgM and ra 2 M had been pretreated 
with MA. From this gel it was concluded, that the thiol ester of ra 2 M was just 
as susceptible to cleavage with MA as the thiol ester of native a-M. Upon 
reduction MA-treated aJA and rajft migrated as a single 180 kD monomer species. 
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Lanes 5 of both Fig. 3 and 4 shoved an additional band of 
approximately 85 kD. When O.M is cleaved in the bait region by proteinases 
present in the blood, it generates two fragments, each with a molecular 
weight of 85 kD. The human 0a M preparation LSJ39 (purified from serum) 

5 contained these cleavage products, while they could not be detected on this 
gel in the two rafl preparations. This indicated that the material secreted 
from the transformed BHK cell lines was largely native uncomplexed <* 2 M. Any 
tt2 M molecules, that have reacted with proteinases are inactivated and can 
not form additional complexes with other proteinases. Since the BHK cell 

10 does not produce any proteinases that forms complexes with the m 2 M product, 
this cell is therefore well suited for production of recombinant human ajl. 

R. Reaction wit h trypsin. 

Reaction with trypsin is a standard way of analyzing the proteinase-complex 
15 formation ability of ^ (Sottrup-Jensen, L. (1987) in: "The Plasma Proteins" 
(Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, FL; Harpel , 
P.C. (1973) J. Exp. Med. 138: 508-521; Harpel, P.C., et al . , (1979) J. Biol. 
Chem. 254: 8869-8878; Swenson, R.P. and Howard, J.B. (1979) J. Biol. Chem. 
254: 4452-4456). In this reaction trypsin will cleave at its target site(s) 
20 in the bait region of ajft, and the resulting reduced cleavage products (85 kD) 
will migrate as a double band. Under nonreducing conditions the trypsin-or 2 M 
complexes will migrate as high molecular weight products. 

Fig. 5 shows the result of such an analysis (performed as 
described (Sottrup-Jensen, L. (1987) in: "The Plasma Proteins" (Putnam, F.W., 
25 ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, FL; Harpel, P.C. (1973) 
^ J Exp. Med. 138: 508-521; Harpel, P.C, et al . , (1979) J. Biol. Chem. 254: 
8869-8878; Swenson, R.P. and Howard, J.B. (1979) J. Biol. Chem. 254: 4452- 
4456)) on the native human O.M preparation LSJ39 (lanes 2 and 5) and on ra 2 M 
from cell lines K16-6 (lanes 3 and 6) and K17-6 (lanes 4 and 7). The samples 
30 in lanes 2, 3 and 4 were not reduced before electrophoresis, while the 
samples in lanes 5, 6 and 7 were. Lane 5 shows that almost all of the human 
native was cleaved with trypsin, while the two preparations of rc^ were 
cleaved with an efficiency of approximately 80% or more. Without reduction 
of the complexes no low molecular weight products from the reaction between 
35 trypsin and the native a& or the BHK cell derived ra 2 M were seen on the gel. 
The 85 kD fragments derived from the recombinant material migrated somewhat 
faster than the human standard; as mentioned above the recombinant materi- 
al might be slightly underglycosylated. 



REPLACEMENT SHEET 



WO 91/03557 



PCT/DK90/00225 



25 

When ctgM is reacted with methylamine, the thiol ester will be 
inactivated, and ctjM changes conformation from the "slow" form to the "fast" 
form (Sottrup- Jensen, L. (1987) in: The Plasma Proteins (Putnam, F.W., ed.) 
2nd Ed., 5: 191-291, Academic Press, Orlando, FL; Van Leuven, F., Cassiman, 
5 J. -J. and Van Den Berghe, H. (1981) J. Biol. Chem. 256: 9016-9022). In this 
conformation it can no longer react rapidly with or form complexes with 
proteinases such as e.g. trypsin. 

Fig. 6 shows the results of a set of experiments that were run 
in parallel to the experiments described above and shown in Fig. 5. However, 
10 before reaction with trypsin the native human a^M and the rOjM used in this 
experiment had been treated with methylamine (Sottrup-Jensen, L., et al . , 
(1980) FEBS Lett. 121: 275-280). Under these conditions both the native 
and the rc^M show a marked decrease in reactivity towards trypsin (80% or 
more of the a 2 M and ra 2 M monomers were migrating as a 180 kD polypeptide). 
15 This indicates that trypsin does not rapidly "cleave at the bait region in 
methylamine treated human or in BHK cell derived ra^. 

In these types of experiments BHK cell derived ra 2 M has shown 
characteristics similar to those of native human a^. 

20 C. Trypsin and methylamine induced conformational change in g ? M. 

As mentioned above the <* 2 M molecule will undergo a conformational 
change both through complex formation with proteinases and through methyl- 
amine induced cleavage of the thiol ester. The change in structure results 
in an altered mobility on rate gels (Sottrup-Jensen, L. (1987) in: The Plasma 

25Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic Press, Orlando, 
FL; Van Leuven, F., Cassiman, J. -J. and Van Den Berghe, H. (1981) J. Biol. 
Chem. 256: 9016-9022); unreacted or 2 M will migrate as a "slow" form, while 
reacted will migrate as a "fast" form. 

Fig. 7 and Fig. 8 show these conformational changes, as they 

30 appear after reaction with trypsin and methylamine, respectively (analyzed 
on 5-10% rate gels) . 

Lanes 1 on both gels contain purified human pregnancy zone 
protein (PZP) (Sand, 0. et al . , (1985) J. Biol. Chem. 260: 15723-15735), 
which is known to appear in both a dimeric (D) and a tetrameric (T) 

35 configuration. 

Lanes 2 on both gels contain unreacted human o^M preparation 
LSJ39. Lanes 3 on both gels show the fast migrating form, resulting from 
reaction with trypsin and methylamine, respectively. Lanes 4 on both gels 
show the unreacted ra 2 M preparation K16-6, and lanes 5 show the corresponding 
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fast forms. Lanes 6 on both gels show the unreacted rafl preparation K17- 
6, and lanes 7 show the corresponding fast forms. 

It can be concluded that both complex formation between ro^M and 
trypsin and reaction of rc^ with methylamine result in the appearance of 

5 fast migrating structures. These structures appear (as analyzed on rate gels) 
to be very similar to the structures obtained when human ot^ was allowed to 
react with trypsin and methylamine. It is also evident from these figures 
that the ra 2 M proteins showed a migration, which, when compared to the 
migration of dimeric and tetrameric PZP on the gels, is in agreement with the 

10 finding that these molecules are produced and secreted from the BHK cells in 
the active tetrameric conformation. 

n. Chromatography of <»-M on a Snpprnse 6 column. 

A Superose 6 column can partially resolve a 2 M molecules in the 
15 dimeric configuration from molecules in the tetrameric configuration 
(Sottrup-Jensen, L. unpublished). Human standard a 2 M and rot^ was analyzed 
on a 24 ml Superose 6 column (buffer: 25 mM Tris-HCl, 125 mM NaCl pH 8.0; 
flow rate: 1 ml/min; fraction size: 1 ml). Fig. 9 shows the diagrams obtained 
from the chromatography of purified human standard o^M and ro^M from the K17- 
20 6 and the K16-6 BHK cell lines. Tetrameric a 2 M (Sottrup-Jensen, unpublished 
observation) will elute in fraction 12 on this type of column. It is evident 
from the chromatograms that both of the ra 2 M preparations eluted in fraction 
12, as did the human standard o^M. On this type of column, dimeric a 2 M 
molecules will elute in fraction 14 and 15 (Sottrup-Jensen, unpublished 
25 observation). This type of analysis supported the results obtained from the 
rate gels (Figs. 7 and 8), that ro^M was secreted from BHK cells in a 
tetrameric configuration. 

E. Trypsin protec tion analysis. 

30 when trypsin is trapped inside the ajft molecule, it retains its 

catalytic capacity towards low molecular weight substrates such as S-2222 
(N-benzoyl-L-Ile-L-Glu-Gly-L-Arg-p-nitroanilide). If trypsin is efficiently 
complexed with a^, it will be protected against high molecular weight 
inhibitors such as Soybean Trypsin Inhibitor (STI) (Sottrup-Jensen, L. (1987) 

35 in: The Plasma Proteins (Putnam, F.W., ed.) 2nd Ed., 5: 191-291, Academic 
Press, Orlando, FL; Ganrot, P.O. (1966) Clin. Chim. Acta. 14: 493-501; 
Sottrup-Jensen, L. et al., (1981) FEBS Lett. 128: 127-132). 

K16-6 and K17-6 derived ro 2 M was compared with human plasma 
in such a protection assay. 100 jtl o^M (in 25 mM Tris-HCl, 125 mM NaCl, pH 
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8.0) was mixed with 30 /il trypsin (0.5 mg/ml in 20 mM sodium acetate pH 5.0). 
After incubating for 2 min. 30 /j1 1 mg/ml STI (in PBS) was added. 10 nl ali- 
quots were removed after 2 and 4 min. and each mixed with 750 pi 0.12 mM S- 
2222 (dissolved 0.1 M sodiumphosphate pH 8.0, 5% dimethyl sul foxide) . 
5 The change in absorbance at 405 nm was recorded for 2 min. The 

results of the assay are given in the following Table II: 



TABLE II 



15 



Prep, of OjM. 


OjjM in cuvette. 


Activity. 




A^min 


M9 


A^mi n/fig 


Human LSJ39 


0.140 


5.00 


0.028 


K16-6 


0.111 


4.62 


0.024 


K17-6 


0.119 


4.87 


0.024 



20 



From these results it can be concluded that ra 2 M had essential- 
ly the same protection capacity for trypsin against STI as compared with the 
protection capacity of human plasma o^M. 

If OgM is treated with methyl amine before the protection assay, 
25 the protection capacity drops dramatically. In a similar assay as that 
described above, methyl amine treated human plasma only retained 17% of 
its protection capacity, while K16-6 and K17-6 ra 2 M retained 16% and 14% 
respectively. It can be concluded that ror^ protected trypsin against STI 
with almost the same efficiency as did human plasma argM. 

30 

E. Amino terminal amino acid sequencing of ra ? M. 

Theoretically, the a 2 M characterized in the present investiga- 
tion could only be either bovine (contaminant from serum), from hamster 
(endogenous product from the BHK cell) or derived from expression of the 

35 transfected plasmid pll67. The ELISA assay used never recognized any in 
BHK cell conditioned medium, whether with or without added fetal calf serum. 
To make sure that the investigated or 2 M was human c^M, and to characterize the 
amino terminal processing of the recombinant product, amino terminal amino 
acid sequence determination was carried on out K16-6 and K17-6 ra 2 M as 

40 described (Sottrup-Jensen, L. et al . , (1984) J. Biol. Chem. 259: 8293-8303). 
The Edman degradation was repeated for 12 cycles, and the identity of the 
detected amino acid derivative in each cycle, was in total agreement with the 
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amino terminal sequence of human afl: Ser-Val -Ser-Gly-Lys-Pro-Gln-Tyr-Met- 
Val-Leu-Val-, whereas bovine has the following amino terminal sequence: 
Ala-Val-Asp-Gly-Lys-Pro-Gln-Tyr-Met-Val-Leu-Val- (unpublished, Dr. Torsten 
Kristensen, Department of Molecular Biology, University of Aarhus, Denmark.) 

5 

EXAMPLE 4. 

Construction and expression n f a b ait, region mutant of human (ML 

In the present example it is demonstrated that the bait region 

of human cr 2 M can be substituted by the bait region of human pregnancy zone 
10 protein (PZP) (Sottrup Jensen, L., Folkersen, J., Kristensen, T. and Tack, 

B.F. Partial primary structure of human pregnancy zone protein: extensive 

sequence homology with human alpha 2-macroglobul in. Proc. Natl. Acad. Sci . 

U.S.A. 81. 7353-7357, 1984; Sand, 0., Folkersen, J., Westergaard, J.G. and 

Sottrup Jensen, L. Characterization of human pregnancy zone protein. 
15 Comparison with human alpha 2-macroglobul in. J.Biol.Chem. 260, 15723-15735, 

1985). The resulting a 2 M bait region mutant exhibited a proteinase inhibitor 

profile similar to that of human pregnancy zone protein. 

To facilitate substitution of DNA fragments encoding the bait 

region of human cDNA, target sites for the restriction enzymes Pstl and 
20SacII were introduced at the 5' and at the 3' end of the cDNA region encoding 

the bait region. 

The human a 2 M expression plasmid pll67 was digested with BamHI and 
Clal, and a 2660 bp fragment, which carried the central part of the human 
cDNA, was subcloned in the fiamHI and CJal digested vector pSX191. 
25 This vector, which had previously been constructed, is a 

derivative of pUC19. It was constructed as described: pUC19 was digested 
with EcoRI and Hindlll, and a synthetic linker with the following sequence 

Kpnl Pstl EcoRI Hind3 Clal SphI BamHI 
30 AATTGGTACCCTGCAGGAATTCAAGCTTATCGATGGCATGCGGATCC - N0R781 
CCATGGGACGTCCTTAAGTTCGAATAGCTACCGTACGCCTAGGTCGA - N0R782 

was cloned in the digested pUC19 vector. The linker, which was an annealing 
product from the two synthetic oligonucleotides N0R781 and N0R782, has 
35 cohesive ends that will ligate to the EcoRI and the Hindlll sites of pUC19 
in such a way that these ligation sites are not regenerated in the pSX191 
vector. Thus pSX191 carried sites for Kpnl, Pstl, EcoRI, Hindlll, Clal, SphI 
and Bam HI. 

The resulting plasmid pSX191a 2 M was digested with BamHI and 
40 Hindlll, and a purified 2.6 kb BjmHI/Hjndlll a 2 M fragment was cloned in 
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M13mpl8 to generate MISmplScrgM for mutagenesis by described methods. A 
synthetic oligonucleotide N0R973, with the following sequence: 

5 ' ( TTC ATACTGCTGCAGCTGTGGAC AC ) 3 ' 
was used to introduce a PstI site at position 2102 (SEQ ID N0:1) in the cDNA 
5 sequence, and a oligonucleotide (N0R974) with the following sequence: 

5' (AGCCACCCCCGC^AGTmCCAC)3' 
was used to introduce a SacII site at position 2271 (SEQ ID N0:1) in the 
cDNA sequence. These sites were chosen because they did not introduce 
alterations in the encoded amino acid sequence, and they were within a 

10 convenient distance of the bait region in human cDNA. Both primers were 
used in the same mutagenesis experiment (Kunkel , T.A., Roberts, J.D. and 
Zakour, R.A. Rapid and Efficient Site-Specific Mutagenesis without Phenotypic 
Selection. Methods in Enzvmol . 154 . 367-382, 1987); dsDNA was isolated from 
mutated MlSmplSo^M plaques, and the DNA was digested with the restriction 

15 enzymes PstI and SacII. Correctly mutated recombinants, which had an insert 
of 160 bp, were further analyzed by DNA sequencing (Tabor, S. and Richardson, 
C.C. DNA sequence analysis with a modified bacteriophage T7 DNA polymerase. 
Proc. Natl. Acad. Sci . U.S.A. 84> 4767-4771, 1987). A 2.6 kb BamHI/Hindlll 
fragment from a correct o^M cDNA mutant (M13mpl8a 2 M#212. 1) was subcloned in 

20a BamHI/Hindlll digested pUC13 vector, and a correct subclone pl308 was 
isolated and characterized with BamHI/Hindlll and Pstl/SacII double 
digestions and DNA electrophoresis. 

The Pst l/SacII fragment in pl308 can be excised and replaced 
with a different DNA fragment, which encodes bait region variants. The 

25 resulting new variants (bait region mutants or analogs) of o^M cDNA can be 
isolated as BamHI/Clal fragments and subcloned back into BamHI/Clal digested 
expression vector pi 167. 

In the present example DNA encoding the amino acids of the bait 
region for human PZP (Sottrup-Jensen et al . 1989, supra ) was obtained from 

30 ligation, annealing and cloning of 8 synthetic oligonucleotide^ . 

The DNA sequence of the synthetic fragment and the encoded amino 
acids as inserted into the a 2 M clone are given in SEQ ID N0:3, and comprises 
positions 2107 to 2305 and the corresponding amino acids. A Pst I site was 
introduced at the 5' end in the synthetic fragment, and SacII and BamHI sites 

35 were introduced at the 3' end. 

This synthetic 0.2 kb DNA fragment was cloned in a Pstl/BamHI 
digested M13mpl8 vector for DNA sequencing. DNA from a clone containing the 
correct sequence was digested with PstI and SacII, and the purified 0.2 kb 
fragment was cloned in a Pstl/SacII digested and gel purified pl308 vector. 
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A correct recombinant, p267PZP, was characterized with restriction enzyme 
digestions, and from this plasmid, bait region mutated {af\ - PZP) cDNA was 
isolated as a 2.7 kb BamHI/Clal fragment and subcloned in a BamHI/Oal 
digested expression vector pll67. The resulting plasmid, designated P 1365, 
5was grown as a large scale plasmid preparation, purified by CsCl centrifuga- 
tion, and cotransfected with pDHFR-I into BHK cells. 

Through this procedure the nucleotides 2102 to 2275 in SEQ ID 
N0:1 was removed and replaced with nucleotides 2102 to 2305 in SEQ ID N0:3. 

The procedures for transfection, selection of bait region mutated 
10a 2 M (designated rc^M-PZP) recombinants (with an a& specific ELISA), large 
scale production and purification of mutated a 2 M were as described elsewhere 
(EXAMPLE 2) in this application. 

Characterisation of the proteinase i n hi bitor speci ficity of a bait region 

15 mutant of human a-M. 

The purified recombinant mutant, ro^M-PZP, was characterized 
with respect to its inhibitor specificity profile against various proteina- 
ses by the use of previously described methods (Sand et al.1985). For 
comparison human plasma derived o^M and PZP were treated with the same set 
20 of proteinases in parallel reactions. The proteinases used were chymotryp- 
sin, elastase, trypsin and staphylococcus aureus Glu-specific proteinase. 
It has been reported (Sand et al.1985) that chymotrypsin and elastase show 
a rapid reaction with both PZP and o^M, while the reaction between the two 
proteinase inhibitors and trypsin and Staphylococcus aureus Glu-specific 
25 proteinase is quite dissimilar for PZP and ajli both proteinases react rapidly 
with o^M, while the reaction with PZP is slow (Sand et al.1985). The reason 
for this difference in reaction rate with the different proteinases is 
believed to be due to the fact that the bait region in PZP contains strong 
specificity determinant for chymotrypsin and elastase, but none for trypsin 
30 and Staphyl ococcus aureus Glu-specific proteinase. 

The results of the analysis is presented in figures 10 to 13. 
Figure 10 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from chymotrypsin treated human o^M, human 
PZP and ra 2 M-PZP. Molecular weight markers (from top to bottom: 180, 120, 92, 
35 60, 43, 26, 14 and 6 kD) were applied to lanes 1 and 8. All samples were 
reduced. Lanes 2, 3 and 4 show the cleavage products obtained from reaction 
of chymotrypsin with human plasma derived PZP, m 2 M-PZP and human plasma 
derived a^, respectively. The ratio of proteinase to inhibitor was 1:1. Lanes 
5, 6 and 7 show cleavage products from similar reactions at a ratio of 2:1 
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between proteinase and the three tested inhibitors. In all 6 lanes cleavage 
products (85 kD) could be identified. This indicated that ra^-PZP reacted 
with chymotrypsin with similar characteristics as did human plasma derived 
ctjjM and PZP. 

5 Figure 11 illustrates the gel electrophoresis (10 - 20 % reducing 

SDS-PA6E) of the reaction products from elastase treated human o^M, human 
PZP and ro^M-PZP. Molecular weight markers were the same as applied on the 
gel in Fig. 2. All samples were reduced. Lanes 2, 3 and 4 show the cleavage 
products obtained from reaction of elastase with human plasma derived PZP, 
lOrctgM-PZP and human plasma derived agM, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In all 6 lanes cleavage products (85 kD) could be identified. 
This indicated that ror^-PZP reacted with elastase with similar character- 
istics as did human plasma derived a 2 M and PZP. 

Figure 12 illustrates the gel electrophoresis (10 - 20 % reducing 
SDS-PAGE) of the reaction products from trypsin treated human a^, human PZP 
and rct>M-PZP. Molecular weight markers were the same as applied on the gel 
in Fig. 2. All samples were reduced. Lanes 2, 3 and 4 show the cleavage 
20 products obtained from reaction of trypsin with human plasma derived PZP, 
human plasma derived and ra 2 M-PZP, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In lanes 3 and 6 cleavage products (85 kD) could be identified 
25 from the reaction between trypsin and ajft. In lanes 2, 4, 5 and 7 no cleavage 
products were observed from the reaction of trypsin with PZP and rc^M-PZP. 
This result demonstrated that ra 2 M-PZP reacted poorly with trypsin as did 
human plasma derived PZP, while a 2 M was cleaved in the reaction with trypsin. 

Figure 13 illustrates the gel electrophoresis (10-20 % reducing 
30 SDS-PAGE) of the reaction products from Staphylococcus aureus Glu-specific 
protease treated human a 2 M, human PZP and ra 2 M-PZP. Molecular weight markers 
were the same as applied on the gel in Fig. 2. All samples were reduced. 
Lanes 2, 3 and 4 show the cleavage products obtained from reaction of 
Staphylo coccus aureus Glu-specific protease with human plasma derived PZP, 
35ra 2 M-PZP and human plasma derived o^M, respectively. The ratio of proteinase 
to inhibitor was 1:1. Lanes 5, 6 and 7 show cleavage products from similar 
reactions at a ratio of 2:1 between proteinase and the three tested 
inhibitors. In lanes 4 and 7 cleavage products (85 kD) could be identified 
from the reaction between Staphylococcus aureus Glu-specific protease and 
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tt2 M. In lanes 2, 3, 5 and 6 much less cleavage product could be identified 
from the reaction of this proteinase with .PZP and ra 2 M-PZP. This result 
demonstrated that ro^-PZP reacted poorly with the Sta phylococcus aureus 
proteinase as did human plasma derived PZP, while c^M was cleaved in the 

5 reaction with this proteinase. 

It can be concluded that ro^-PZP showed the same pattern of 
reaction with four proteinases as did human plasma derived PZP. This pattern 
of reaction was different from the corresponding pattern obtained from 
reaction with aj\. Thus rc^-PZP has been demonstrated to have a proteinase 
10 inhibitor profile similar to native PZP and dissimilar to a*. Thus it has 
been demonstrated that the proteinase inhibitor profile of can be 
modulated by substitution of DNA fragments encoding the bait region. 

The substitution as described in this invention did not destroy 
the activity of the proteinase inhibitor, and it is therefore demonstrated 
15 that functional macroglobul in hybrids can be constructed by substitutions 
(mutations) in the bait region. The finding will lead to the design of o^- 
derivatives with new desired proteinase specificities. No doubt, these 
results could be extended to other macroglobul in based hybrids, in which the 
bait region can be modified at will to obtain new inhibitor specificities. 
20 Aggressive activity of proteinases is often a problem in relation 

to various diseases (e.g. the activity of elastase and cathepsin G in severe 
inflammation leads to tissue and organ destruction and failure). Inhibitors 
of such proteinases will be useful in drug design. In situations where the 
target site for the proteinase is known, but no inhibitor can be identified, 
25a 2 M can be engineered (mutated in the bait region) to obtain the desired 
specificity. In a situation where the target specificity of the proteinase 
in question is unknown, saturation mutagenesis or random synthesis of the 
bait region will lead to an indefinite number of target sequences that can 
be introduced and expressed in hybrid macroglobul ins. These hybrids can be 
30 screened for proteinase inhibition, and the target sequence(s) can be 
identified. The resulting a 2 M analog can be produced and purified as described 
elsewhere in this invention. Upon injection into the circulation such a 2 M 
analogs will inhibit and clear from the blood any proteinase of the given 
specificity. 

35 Introduction of protein analogs or mutants in the human body 

always raises the possibility for antigenicity. The generation of. a panel 
of 45 mouse monoclonal antibodies against human a 2 M has been described (Van 
Leuven et al.1988; Delain et al.1988). None of these antibodies were directed 
against the bait region. This indicates that the bait region is not highly 
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antigenic and that mutants in this region of the molecule can be generated 
and used for therapeutical uses without risk for antibody development. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Novo Nordisk A/S 



(ii) TITLE OF INVENTION: Expression of Plasma Glycoprotei 



(iii) NUMBER OF SEQUENCES: 4 




(B) STREET: Novo Alle 

(C) CITY: Bagsvaerd 

(E) COUNTRY: DENMARK 

(F) ZIP: DK-2880 



5: 

Nordisk A/S, Patent Department 




R: DK 4235/89, DK 4236/89, DK 4237/89 



(B) FILING DATE: 29-AUG-1989 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(F) TISSUE TYPE: Hepatic 

(G) CELL TYPE: Hepatoblastoma 

(H) CELL LINE: HepG2 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 29.. 4450 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GTCTCCTCCA GCTCCTTCTT TCTGCAAC ATG GGG AAG AAC AAA CTC CTT 




100 
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6TC 
Val 
25 



TCT GGA AAA 
Ser Gly Lys 



CCG 
Pro 



CAG TAT ATG GTT 
Gin Tyr Met Val 
30 



CTG GTC 
Leu Val 
35 



CCC TCC CTG CTC 
Pro Ser Leu Leu 



CAC 
His 
40 



ACT 
Thr 



GAG ACC ACT 
Glu Thr Thr 



GAG 
Glu 
45 



AAG GGC TGT GTC 
Lys Gly Cys Val 



CTT CTG 
Leu Leu 
50 



AGC TAC CTG AAT 
Ser Tyr Leu Asn 
55 



GAG 
Glu 



ACA GTG ACT GTA AGT GCT TCC TTG GAG TCT GTC AGG GGA AAC AGG AGC 244 
Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg Ser 
60 65 70 

CTC TTC ACT GAC CTG GAG GCG GAG AAT GAC GTA CTC CAC TGT GTC GCC 292 
Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val Ala 
75 80 85 

TTC GCT GTC CCA AAG TCT TCA TCC AAT GAG GAG GTA ATG TTC CTC ACT 340 
Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Met Phe Leu Thr 
90 95 100 

GTC CAA GTG AAA GGA CCA ACC CAA GAA TTT AAG AAG CGG ACC ACA GTG 388 
Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr Val 
105 110 115 " 120 

ATG GTT AAG AAC GAG GAC AGT CTG GTC TTT GTC CAG ACA GAC AAA TCA 436 
Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys Ser 
125 130 135 

ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT GTC TCC ATG GAT 484 
He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser Met Asp 
140 145 ~ 150 

GAA AAC TTT CAC CCC CTG AAT GAG TTG ATT CCA CTA GTA TAC ATT CAG 532 
Glu Asn Phe His Pro Leu Asn Glu Leu He Pro Leu Val Tyr He Gin 
155 160 165 

GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT TTC CAG TTA GAG 580 
Asp Pro Lys Gly Asn Arg He Ala Gin Trp Gin Ser Phe Gin Leu Glu 
170 175 180 

GGT GGC CTC AAG CAA TTT TCT TTT CCC CTC TCA TCA GAG CCC TTC CAG 628 
Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser Glu Pro Phe Gin 
185 190 195 200 

GGC TCC TAC AAG GTG GTG GTA CAG AAG AAA TCA GGT GGA AGG ACA GAG 676 
Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly Gly Arg Thr Glu 
205 210 " 215 

CAC CCT TTC ACC GTG GAG GAA TTT GTT CTT CCC AAG TTT GAA GTA CAA 724 
His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys Phe Glu Val Gin 
220 225 230 

GTA ACA GTG CCA AAG ATA ATC ACC ATC TTG GAA GAA GAG ATG AAT GTA 772 
Val Thr Val Pro Lys He He Thr He Leu Glu Glu Glu Met Asn Val 
235 240 245 
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TCA GTG TGT GGC CTA TAC ACA TAT GGG AAG CCT GTC CCT GGA CAT GTG 
Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val Pro Gly His Val 
250 255 2o° 

ACT GTG AGC ATT TGC AGA AAG TAT AGT GAC GCT TCC GAC TGC CAC GGT 
Thr Val Ser He Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His Gly 
265 270 275 ™ 

GAA GAT TCA CAG GCT TTC TGT GAG AAA TTC AGT GGA CAG CTA AAC AGC 
Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn 5er 
285 290 

r at GGC TGC TTC TAT CAG CAA GTA AAA ACC AAG GTC TTC CAG CTG AAG 
ml GlS Ss He lyr Gin ITn Val Lys Thr Lys Val Phe Gin Leu Lys 
300 305 

AGG AAG GAG TAT GAA ATG AAA CTT CAC ACT GAG GCC CAG ATC CAA GAA 
Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala Gin He Gin Glu 
315 320 325 

GAA GGA ACA GTG GTG GAA TTG ACT GGA AGG CAG TCC AGT GAA ATC ACA 
Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu He Thr 
330 335 340 

AGA ACC ATA ACC AAA CTC TCA TTT GTG AAA GTG GAC TCA CAC TTT CGA 
Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe Arg 
345 350 355 360 

TAG GGA ATT CCC TTC TTT GGG CAG GTG CGC CTA GTA GAT GGG AAA GGC 
iln tfy He Pro Phe Phe Gly Gin Val Arg Leu Val Asp Gly Lys Gly 
365 370 375 

GTC CCT ATA CCA AAT AAA GTC ATA TTC ATC AGA GGA AAT GAA GCA AAC 
Val Pro lie Pro Asn Lys Val He Phe He Arg Gly Asn Glu Ala Asn 
380 385 390 

TAT TAC TCC AAT GCT ACC ACG GAT GAG CAT GGC CTT GTA CAG TTC TCT 
lyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe Ser 
y * 395 400 405 

ATP AAC ACC ACC AAT GTT ATG GGT ACC TCT CTT ACT GTT AGG GTC AAT 
He ten ?S ?hr "n Val Met Gly Thr Ser Leu Thr Val Arg Val Asn 
410 415 4Z0 

TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG GTG TCA GAA GAA 
lyr \?s Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp Val Ser Glu Glu 
425 430 435 

CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTC TCC CCA AGC AAG 
His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe Ser Pro Ser Lys 
445 450 455 

AGC TTT GTC CAC CTT GAG CCC ATG TCT CAT GAA CTA CCC TGT GGC CAT 
Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly His 
460 465 470 
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ACT CAG ACA GTC CAG GCA CAT TAT ATT CTG AAT GGA GGC ACC CTG CTG 1492 
Thr Gin Thr Val Gin Ala His Tyr lie Leu Asn Gly Gly Thr Leu Leu 
475 480 485 

GGG CTG AAG AAG CTC TCC TTC TAT TAT CTG ATA ATG GCA AAG GGA GGC 1540 
Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu He Met Ala Lys Gly Gly 
490 495 500 

ATT GTC CGA ACT GGG ACT CAT GGA CTG CTT GTG AAG CAG GAA GAu ATG 1588 
He Val Arg Thr Gly Thr His Gly Leu Leu Val Lys Gin Glu Asp Met 
505 510 515 520 

AAG GGC CAT TTT TCC ATC TCA ATC CCT GTG AAG TCA GAC ATT GCT CCT 1636 
Lys Gly His Phe Ser lie Ser He Pro Val Lys Ser Asp lie Ala Pro 
525 530 535 

GTC GCT CGG TTG CTC ATC TAT GCT GTT TTA CCT ACC GGG GAC GTG ATT 1684 
Val Ala Arg Leu Leu He Tyr Ala Val Leu Pro Thr Gly Asp Val He 
540 545 550 

GGG GAT TCT GCA AAA TAT GAT GTT GAA AAT TGT CTG GCC AAC AAG GTG 1732 
Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu Ala Asn Lys Val 
555 560 565 

GAT TTG AGC TTC AGC CCA TCA CAA AGT CTC CCA GCC TCA CAC GCC CAC 1780 
Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala Ser His Ala His 
570 575 580 

CTG CGA GTC ACA GCG GCT CCT CAG TCC GTC TGC GCC CTC CGT GCT GTG 1828 
Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala Leu Arg Ala Val 
585 590 595 600 

GAC CAA AGC GTG CTG CTC ATG AAG CCT GAT GCT GAG CTC TCG GCG TCC 1876 
Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu Leu Ser Ala Ser 
605 610 615 

TCG GTT TAC h?Z CTG CTA CCA GAA AAG GAC CTC ACT GGC TTC CCT GGG 1924 
Ser Val Tyr A^.i Leu Leu Pro Glu Lys Asp Leu Thr Gly Phe Pro Gly 
620 625 630 

CCT TTG AAT GAC CAG GAC GAT GAA GAC TGC ATC AAT CGT CAT AAT GTC 1972 
Pro Leu Asn Asp Gin Asp Asp Glu Asp Cys He Asn Arg His Asn Val 
635 640 645 

TAT ATT AAT GGA ATC ACA TAT ACT CCA GTA TCA AGT ACA AAT GAA AAG 2020 
Tyr He Asn Gly He Thr Tyr Thr Pro Val Ser Ser Thr Asn Glu Lys 
650 655 660 

GAT ATG TAC AGC TTC CTA GAG GAC ATG GGC TTA AAG GCA TTC ACC AAC 2068 
Asp Met Tyr Ser Phe Leu Glu Asp Met Gly Leu Lys Ala Phe Thr Asn 
665 670 675 680 

TCA AAG ATT CGT AAA CCC AAA ATG TGT CCA CAG CTT CAA CAG TAT GAA 2116 
Ser Lys He Arg Lys Pro Lys Met Cys Pro Gin Leu Gin Gin Tyr Glu 
685 690 695 
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ATP TAT GGA CCT GAA GGT CTA CGT GTA GGT TTT TAT GAG TCA GAT GTA 
Set ml 85 PrI GlS Gly Leu Arg Val Gly Phe Tyr Glu Ser Asp Val 
700 705 /AU 

ATG GGA AGA GGC CAT GCA CGC CTG GTG CAT GTT GAA GAG CCT CAC ACG 
Met Gly Arg Gly His Ala Arg Leu Val His Val Glu Glu Pro His inr 
715 720 

GAG ACC GTA CGA AAG TAC TTC CCT GAG ACA TGG ATC TGG GAT TTG GTG 
Glu ?hr 5™ Arg l£ Tyr Phe Pro Glu Thr Trp lie Trp Asp Leu Val 
730 735 

nr pta AAr TCA GCA GGT GTG GCT GAG GTA GGA GTA ACA GTC CCT GAC 
5H Si £s C n 2f Ala Gly V.I Ala Glu Val Gly Val Thr Val Pro Asp 
745 750 755 7t>u 

ACC ATC ACC GAG TGG AAG GCA GGG GCC TTC TGC CTG TCT GAA GAT GCT 
Thr lie Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu Ser Glu Asp Ala 
765 770 I'* 

GGA CTT GGT ATC TCT TCC ACT GCC TCT CTC CGA GCC TTC CAG CCC TTC 
Gly Leu Gly lie Ser Ser Thr Ala Ser Leu Arg Ala Phe Gin Pro Phe 

TTT GTG GAG CTT ACA ATG CCT TAC TCT GTG ATT CGT GGA GAG GCC TTC 
Phe Val Glu Leu Thr Met Pro Tyr Ser Val He Arg Gly Glu Ala Phe 
795 800 80b 

ACA CTC AAG GCC ACG GTC CTA AAC TAC CTT CCC AAA TGC ATC CGG GTC 
Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys Cys He Arg Val 
810 815 820 

AGT GTG CAG CTG GAA GCC TCT CCC GCC TTC CTA GCT GTC CCA GTG GAG 
Ser 51? GlS Leu ITu Ala Ser Pro Ala Phe Leu Ala Val Pro Val Glu 
825 830 835 MU 

AAG GAA CAA GCG CCT CAC TGC ATC TGT GCA AAC GGG CGG CAA ACT GTG 
Lys Glu Gin Ala Pro His Cys He Cys Ala Asn Gly Arg Gin Thr Val 
J 845 850 855 

TCC TGG GCA GTA ACC CCA AAG TCA TTA GGA AAT GTG AAT TTC ACT GTG 
Ser Trp Ala Val Thr Pro Lys Ser Leu Gly Asn Val Asn Phe Thr Val 
860 865 870 

AGC GCA GAG GCA CTA GAG TCT CAA GAG CTG TGT GGG ACT GAG GTG CCT 
Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly Thr Glu Val Pro 
875 880 885 

TCA GTT CCT GAA CAC GGA AGG AAA GAC ACA GTC ATC AAG CCT CTG TTG 
Ser 5al Pro Glu His Gly Arg Lys Asp Thr Val lie Lys Pro Leu Leu 
890 895 900 

GTT GAA CCT GAA GGA CTA GAG AAG GAA ACA ACA TTC AAC TCC CTA CTT 
Sal Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe Asn Ser Leu Leu 
905 910 915 920 
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TGT CCA TCA GGT GGT GAG GTT TCT GAA GAA TTA TCC CTG AAA CTG CCA 2836 
Cys Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser Leu Lys Leu Pro 
925 930 935 

CCA AAT GTG GTA GAA GAA TCT GCC CGA GCT TCT GTC TCA GTT TTG GGA 2884 
Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val Ser Val Leu Gly 
940 945 950 

GAC ATA TTA GGC TCT GCC ATG CAA AAC ACA CAA AAT CTT CTC CAG ATG 2932 
Asp He Leu Gly Ser Ala Met Gin Asn Thr Gin Asn Leu Leu Gin Met 
955 960 965 

CCC TAT GGC TGT GGA GAG CAG AAT ATG GTC CTC TTT GCT CCT AAC ATC 2980 
Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe Ala Pro Asn He 
970 975 980 

TAT GTA CTG GAT TAT CTA AAT GAA ACA CAG CAG CTT ACT CCA GAG ATC 3028 
Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu Thr Pro Glu He 
985 990 995 1000 

AAG TCC AAG GCC ATT GGC TAT CTC AAC ACT GGT TAC CAG AGA CAG TTG 3076 
Lys Ser Lys Ala He Gly Tyr Leu Asn Thr Gly Tyr Gin Arg Gin Leu 
1005 1010 1015 

AAC TAC AAA CAC TAT GAT GGC TCC TAC AGC ACC TTT GGG GAG CGA TAT 3124 
Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe Gly Glu Arg Tyr 
1020 1025 1030 

GGC AGG AAC CAG GGC AAC ACC TGG CTC ACA GCC TTT GTT CTG AAG ACT 3172 
Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe Val Leu Lys Thr 
1035 1040 1045 

TTT GCC CAA GCT CGA GCC TAC ATC TTC ATC GAT GAA GCA CAC ATT ACC 3220 
Phe Ala Gin Ala Arg Ala Tyr He Phe lie Asp Glu Ala His He Thr 
1050 1055 1060 

CAA GCC CTC ATA TGG CTC TCC CAG AGG CAG AAG GAC AAT GGC TGT TTC 3268 
Gin Ala Leu He Trp Leu Ser Gin Arg Gin Lys Asp Asn Gly Cys Phe 
1065 1070 1075 1080 

AGG AGC TCT GGG TCA CTG CTC AAC AAT GCC ATA AAG GGA GGA GTA GAA 3316 
Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly Val Glu 
1085 1090 1095 

GAT GAA GTG ACC CTC TCC GCC TAT ATC ACC ATC GCC CTT CTG GAG ATT 3364 
Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala Leu Leu Glu He 
1100 1105 1110 

CCT CTC ACA GTC ACT CAC CCT GTT GTC CGC AAT GCC CTG TTT TGC CTG 3412 
Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala Leu Phe Cys Leu 
1115 1120 " 1125 

GAG TCA GCC TGG AAG ACA GCA CAA GAA GGG GAC CAT GGC AGC CAT GTA 3460 
Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser His Val 
1130 1135 1140 
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TAT ACC AAA GCA CT6 CTG GCC TAT GCT TTT GCC CTG GCA GGT AAC CAG 
lyr m> \£ AU Leu Leu Ala Tyr Ala Phe Ala Leu Ala Gly Asn Gin 
1145 1150 H55- 1160 

GAC AAG AGG AAG GAA GTA CTC AAG TCA CTT AAT GAG GAA GCT GTG AAG 
Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu Glu Ala ValLys 
1165 1170 

AAA GAC AAC TCT GTC CAT TGG GAG CGC CCT CAG AAA CCC AAG GCA CCA 
lys Asp Asn Ser Val His Tr P Glu Arg Pro Gin Lys Pro Lys Ala Pro 
1180 H85 liyu 

GTG GGG CAT TTT TAC GAA CCC CAG GCT CCC TCT GCT GAG GTG GAG ATG 
Val Gly His Phe Tyr Glu Pro Gin Ala Pro Ser Ala Glu Val Glu Met 
lig5 1200 1205 

ACA TCC TAT GTG CTC CTC GCT TAT CTC ACG GCC CAG CCA GCC CCA ACC 
Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin Pro Ala Pro Thr 
1210 1215 1220 

TCG GAG GAC CTG ACC TCT GCA ACC AAC ATC GTG AAG TGG ATC ACG AAG 
Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys Trp He Thr Lys 
1225 1230 1235 1240 

CAG CAG AAT GCC CAG GGC GGT TTC TCC TCC ACC CAG CAC ACA GTG GTG 
Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin His Thr Val Val 
1245 1250 1255 

GCT CTC CAT GCT CTG TCC AAA TAT GGA GCA GCC ACA TTT ACC AGG ACT 
Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr Phe Thr Arg Thr 
1260 1265 12/0 

GGG AAG GCT GCA CAG GTG ACT ATC CAG TCT TCA GGG ACA TTT TCC AGC 
Glv Lvs Ala Ala Gin Val Thr He Gin Ser Ser Gly Thr Phe Ser Ser 
1275 1280 1285 

AAA TTC CAA GTG GAC AAC AAC AAC CGC CTG TTA CTG CAG CAG GTC TCA 
Lvs Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu Gin Gin Val Ser 
1290 1295 1300 

TTG CCA GAG CTG CCT GGG GAA TAC AGC ATG AAA GTG ACA GGA GAA GGA 
Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val Thr Gly Glu Gly 
1305 1310 1315 1320 

TGT GTC TAC CTC CAG ACA TCC TTG AAA TAC AAT ATT CTC CCA GAA AAG 
Cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He Leu Pro Glu Lys 
1325 1330 1335 



GAA GAG TTC CCC TTT GCT TTA GGA GTG CAG ACT CTG CCT CAA ACT TGT 
Glu Glu Phe Pro Phe Ala Leu Gly Val Gin Thr Leu Pro Gin Thr Cys 
1340 1345 1350 

GAT GAA CCC AAA GCC CAC ACC AGC TTC CAA ATC TCC CTA AGT GTC AGT 
Asp Glu Pro Lys Ala His Thr Ser Phe Gin He Ser Leu Ser Val Ser 
1355 1350 1365 
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TAC ACA GGG AGC CGC TCT GCC TCC AAC ATG GCG ATC GTT GAT GTG AAG 4180 
Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He Val Asp Val Lys 
1370 * 1375 1380 

ATG GTC TCT GGC TTC ATT CCC CTG AAG CCA ACA GTG AAA ATG CTT GAA 4228 
Met /al Ser Gly Phe He Pro Leu Lys Pro Thr Val Lys Met Leu Glu 
1385 1390 1395 1400 

AGA TCT AAC CAT GTG AGC CGG ACA GAA GTC AGC AGC AAC CAT GTC TTG 4276 
Arg Ser Asn His Val Ser Arg Thr Glu Val Ser Ser Asn His Val Leu 
1405 * 1410 1415 

ATT TAC CTT GAT AAG GTG TCA AAT CAG ACA CTG AGC TTG TTC TTC ACG 4324 
He Tyr Leu Asp Lys Val Ser Asn Gin Thr Leu Ser Leu Phe Phe Thr 
1420 1425 1430 

GTT CTG CAA GAT GTC CCA GTA AGA GAT CTC AAA CCA GCC ATA GTG AAA 4372 
Val Leu Gin Asp Val Pro Val Arg Asp Leu Lys Pro Ala He Val Lys 
1435 1440 1445 

GTC TAT GAT TAC TAC GAG ACG GAT GAG TTT GCA ATT GCT GAG TAC AAT 4420 
Val Tyr Asp Tyr Tyr Glu Thr Asp Glu Phe Ala He Ala Glu Tyr Asn 
1450 1455 1460 

GCT CCT TGC AGC AAA GAT CTT GGA AAT GCT TGAAGACCAC AAGGCTGAAA 4470 
Ala Pro Cys Ser Lys Asp Leu Gly Asn Ala 
1465 1470 

AGTGCTTTGC TGGAGTCCTG TTCTCTGAGC TCCACAGAAG ACACGTGTTT TTGTATCTTT 4530 
AAAGACTTGA TGAATAAACA CTTTTTCTGG TCAAAAAAA 4569 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1474 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(E) FEATURES: bait region: 690-730 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Gly Lys Asn Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu 
15 10 15 

Val Leu Leu Pro Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met 
20 25 30 

Val Leu Val Pro Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys 
35 40 45 

Val Leu Leu Ser Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu 
50 55 60 

Glu Ser Val Arg Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu 
65 70 75 80 
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Asn Asp Val Leu His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser 
85 90 95 

Asn Glu Glu Val Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin 
100 105 H° 

Glu Phe Lys Lys Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu 
115 " 120 125 

Val Phe Val Gin Thr Asp Lys Ser He Tyr Lys Pro Gly Gin Thr Val 
130 135 140 

Lys Phe Arg Val Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu 
145 150 155 160 

Leu He Pro Leu Val Tyr He Gin Asp Pro Lys Gly Asn Arg lie Ala 
165 170 175 

Gin Trp Gin Ser Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe 
180 185 19° 

Pro Leu Ser Ser Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin 
195 200 205 

Lys Lys Ser Gly Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe 
210 215 220 

Val Leu Pro Lys Phe Glu Val Gin Val Thr Val Pro Lys He He Thr 
225 230 235 240 

He Leu Glu Glu Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr 
245 250 255 

Gly Lys Pro Val Pro Gly His Val Thr Val Ser He Cys Arg Lys Tyr 
260 255 270 

Ser Asp Ala Ser Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu 
275 280 285 

Lvs Phe Ser Gly Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val 
290 295 300 

Lys Thr Lys Val Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu 
305 310 315 320 

His Thr Glu Ala Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr 
325 330 335 

Gly Arg Gin Ser Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe 
340 345 350 

Val Lys Val Asp Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin 
355 360 365 

Val Arg Leu Val Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He 
370 375 380 
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Phe He Arg Gly Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp 
385 390 395 400 

Glu His Gly Leu Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly 
405 410 415 

Thr Ser Leu Thr Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr 
420 425 430 

Gly Tyr Gin Trp Val Ser Glu Glu His Glu Glu Ala His His Thr Ala 
435 440 445 

Tyr Leu Val Phe Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met 
450 455 460 

Ser His Glu Leu Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr 
465 470 475 480 

He Leu Asn Gly Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr 
485 490 495 

Tyr Leu He Met Ala Lys Gly Gly He Val Arg Thr Gly Thr His Gly 
500 505 ~ 510 

Leu Leu Val Lys Gin Glu Asp Met Lys Gly His Phe Ser He Ser He 
515 520 525 

Pro Val Lys Ser Asp He Ala Pro Val Ala Arg Leu Leu He Tyr Ala 
530 535 540 

Val Leu Pro Thr Gly Asp Val He Gly Asp Ser Ala Lys Tyr Asp Val 
545 550 555 560 

Glu Asn Cys Leu Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin 
565 570 575 

Ser Leu Pro Ala Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin 
580 585 590 

Ser Val Cys Ala Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys 
595 600 605 

Pro Asp Ala Glu Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu 
610 615 620 

Lys Asp Leu Thr Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu 
625 630 635 640 

Asp Cys He Asn Arg His Asn Val Tyr He Asn Gly He Thr Tyr Thr 
645 650 655 

Pro Val Ser Ser Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp 
660 665 670 

Met Gly Leu Lys Ala Phe Thr Asn Ser Lys He Arg Lys Pro Lys Met 
675 680 685 
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Cys Pro Gin Leu Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Arg 
^ 690 695 700 

Val Gly Phe Tyr Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu 
705 710 715 7Z0 

Val His Val Glu Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro 
725 730 73b 

Glu Thr Trp He Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala 
740 745 '5U 

Glu Val Gly Val Thr Val Pro Asp Thr He Thr Glu Trp Lys Ala Gly 
755 760 765 

Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly lie Ser Ser Thr Ala 
770 775 780 

Ser Leu Arg Ala Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr 
785 790 795 800 

Ser Val He Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn 
805 810 815 

Tyr Leu Pro Lys Cys He Arg Val Ser Val Gin Leu Glu Ala Ser Pro 
820 825 830 

Ala Phe Leu Ala Val Pro Val Glu Lys Glu Gin Ala Pro His Cys He 
835 840 845 

Cys Ala Asn Gly Arg Gin Thr Val Ser Trp Ala Val Thr Pro Lys Ser 
850 855 860 

Leu Gly Asn Val Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin 
865 870 875 880 

Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys 
885 890 895 

Asp Thr Val He Lys Pro Leu Leu Val Glu Pro Glu Gly Leu Glu Lys 
900 905 910 

Glu Thr Thr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu Val Ser 
915 920 925 

Glu Glu Leu Ser Leu Lys Leu Pro Pro Asn Val Val Glu Glu Ser Ala 
930 935 940 

Arg Ala Ser Val Ser Val Leu Gly Asp He Leu Gly Ser Ala Met Gin 
945 950 955 950 

Asn Thr Gin Asn Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn 
965 970 975 

Met Val Leu Phe Ala Pro Asn He Tyr Val Leu Asp Tyr Leu Asn Glu 
980 985 990 
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Thr Gin Gin Leu Thr Pro Glu He Lys Ser Lys Ala He Gly Tyr Leu 
995 1000 1005 

Asn Thr Gly Tyr Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp Gly Ser 
1010 1015 1020 

Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp 
1025 1030 1035 1040 

Leu Thr Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr He 
1045 1050 ~ 1055 

Phe He Asp Glu Ala His He Thr Gin Ala Leu He Trp Leu Ser Gin 
1060 1065 1070 

Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn 
1075 1080 1085 

Asn Ala He Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr 
1090 1095 1100 

He Thr He Ala Leu Leu Glu He Pro Leu Thr Val Thr His Pro Val 
1105 1110 1115 1120 

Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin 
1125 1130 1135 

Glu Gly Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr 
1140 1145 1150 

Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys 
1155 1160 1165 

Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu 
1170 1175 1180 

Arg Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro Gin 
1185 1190 1195 1200 

Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr 
1205 1210 1215 

Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr 
1220 1225 1230 

Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe 
1235 1240 1245 

Ser Ser Thr Gin His Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr 
1250 1255 1260 

Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He 
1265 1270 1275 1280 

Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn 
1285 1290 1295 
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Arg Leu Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro BlyGlu Tyr 
1300 1305 1310 

Ser Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu 
1315 1320 1325 

Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly 
y 1330 1335 1340 

Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser 
1345 1350 1355 1360 

Phe Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser 
1365 1370 Ad/:> 

Asn Met Ala He Val Asp Val Lys Met Val Ser Gly Phe lie Pro Leu 
1380 1385 1390 

Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr 
3 1395 1400 1405 

Glu Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val Ser Asn 
1410 1415 1420 

Gin Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg 
1425 1430 1435 1440 

Asp Leu Lys Pro Alj^Ile Val Lys Val JyrAsp Tyr Tyr Glu Thr^Asp 

Glu Phe Ala He Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly 
1460 1465 

Asn Ala 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: Y 

(iv) ANTI-SENSE: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 29.. 4480 
(D) OTHER INFORMATION: 
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(ix) FEATURE: 

(A) NAME/KEY: insert ion_seq 

(B) LOCATION: 2102.. 2305 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

6TCTCCTCCA GCTCCTTCTT TCTGCAAC ATG GGG AAG AAC AAA CTC CTT CAT 

Met Gly Lys Asn Lys Leu Leu His 
1 5 

CCA AGT CTG GTT CTT CTC CTC TTG GTC CTC CTG CCC ACA GAC GCC TCA ] 
Pro Ser Leu Val Leu Leu Leu Leu Val Leu Leu Pro Thr Asp Ala Ser 
10 15 20 

GTC TCT GGA AAA CCG CAG TAT ATG GTT CTG GTC CCC TCC CTG CTC CAC ] 
Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro Ser Leu Leu His 
25 30 35 40 

ACT GAG ACC ACT GAG AAG GGC TGT GTC CTT CTG AGC TAC CTG AAT GAG ] 
Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser Tyr Leu Asn Glu 
45 50 55 

ACA GTG ACT GTA AGT GCT TCC TTG GAG TCT GTC AGG GGA AAC AGG AGC t 
Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg Ser 
60 65 70 

CTC TTC ACT GAC CTG GAG GCG GAG AAT GAC GTA CTC CAC TGT GTC GCC 2 
Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val Ala 
75 80 85 

TTC GCT GTC CCA AAG TCT TCA TCC AAT GAG GAG GTA ATG TTC CTC ACT ■ 
Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Met Phe Leu Thr 
90 95 100 

GTC CAA GTG AAA GGA CCA ACC CAA GAA TTT AAG AAG CGG ACC ACA GTG • 
Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr Val 
105 110 115 " 120 

ATG GTT AAG AAC GAG GAC AGT CTG GTC TTT GTC CAG ACA GAC AAA TCA i 
Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys Ser 
125 130 135 

ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT GTC TCC ATG GAT i 
He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser Met Asp 
140 145 150 

GAA AAC TTT CAC CCC CTG AAT GAG TTG ATT CCA CTA GTA TAC ATT CAG E 
Glu Asn Phe His Pro Leu Asn Glu Leu He Pro Leu Val Tyr He Gin 
155 160 165 

GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT TTC CAG TTA GAG J 
Asp Pro Lys Gly Asn Arg lie Ala Gin Trp Gin Ser Phe Gin Leu Glu 
170 175 180 
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GGT GGC CTC AAG CAA TTT TCT TTT CCC CTC TCA TCA GAG CCC TTC CAG 
Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser Glu Pro Phe Gin 
185 190 I 95 200 

GGC TCC TAC AAG GTG GTG GTA CAG AAG AAA TCA GGT GGA AGG ACA GAG 
Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly Gly Arg Thr Glu 
205 210 215 

CAC CCT TTC ACC GTG GAG GAA TTT GTT CTT CCC AAG TTT GAA GTA CAA 
His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys Phe Glu Val Gin 
220 225 230 

GTA ACA GTG CCA AAG ATA ATC ACC ATC TTG GAA GAA GAG ATG AAT GTA 
Val Thr Val Pro Lys He He Thr He Leu Glu Glu Glu Met Asn Val 
235 240 245 

TCA GTG TGT GGC CTA TAC ACA TAT GGG AAG CCT GTC CCT GGA CAT GTG 
Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val Pro Gly His Val 
250 255 260 

ACT GTG AGC ATT TGC AGA AAG TAT AGT GAC GCT TCC GAC TGC CAC GGT 
Thr Val Ser lie Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His Gly 
265 270 275 280 

GAA GAT TCA CAG GCT TTC TGT GAG AAA TTC AGT GGA CAG CTA AAC AGC 
Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn Ser 
285 290 295 

CAT GGC TGC TTC TAT CAG CAA GTA AAA ACC AAG GTC TTC CAG CTG AAG 
His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val Phe Gin Leu Lys 
300 305 310 

AGG AAG GAG TAT GAA ATG AAA CTT CAC ACT GAG GCC CAG ATC CAA GAA 
Arq Lvs Glu Tyr Glu Met Lys Leu His Thr Glu Ala Gin He Gin Glu 
315 320 325 

GAA GGA ACA GTG GTG GAA TTG ACT GGA AGG CAG TCC AGT GAA ATC ACA 
Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu He Thr 
330 335 340 

AGA ACC ATA ACC AAA CTC TCA TTT GTG AAA GTG GAC TCA CAC TTT CGA 
Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe Arg 
345 350 355 360 

CAG GGA ATT CCC TTC TTT GGG CAG GTG CGC CTA GTA GAT GGG AAA GGC 
Gin Gly lie Pro Phe Phe Gly Gin Val Arg Leu Val Asp Gly Lys Gly 
365 370 375 

GTC CCT ATA CCA AAT AAA GTC ATA TTC ATC AGA GGA AAT GAA GCA AAC 
Val Pro He Pro Asn Lys Val He Phe He Arg Gly Asn Glu Ala Asn 
380 385 390 

TAT TAC TCC AAT GCT ACC ACG GAT GAG CAT GGC CTT GTA CAG TTC TCT 
Tvr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe Ser 
395 400 405 
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ATC AAC ACC ACC AAT GTT ATG GGT ACC TCT CTT ACT GTT AGG GTC AAT 1300 
He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr Val Arg Val Asn 
410 415 420 

TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG GTG TCA GAA GAA 1348 
Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp Val Ser Glu Glu 
425 430 435 440 

CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTC TCC CCA AGC AAG 1396 
His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe Ser Pro Ser Lys 
445 450 455 

AGC TTT GTC CAC CTT GAG CCC ATG TCT CAT GAA CTA CCC TGT GGC CAT 1444 
Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly His 
460 465 470 

ACT CAG ACA GTC CAG GCA CAT TAT ATT CTG AAT GGA GGC ACC CTG CTG 1492 
Thr Gin Thr Val Gin Ala His Tyr lie Leu Asn Gly Gly Thr Leu Leu 
475 480 485 

GGG CTG AAG AAG CTC TCC TTC TAT TAT CTG ATA ATG GCA AAG GGA GGC 1540 
Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu He Met Ala Lys Gly Gly 
490 495 500 

ATT GTC CGA ACT GGG ACT CAT GGA CTG CTT GTG AAG CAG GAA GAC ATG 1588 
He- Val Arg Thr Gly Thr His Gly Leu Leu Val Lys Gin Glu Asp Met 
505 510 515 520 

AAG GGC CAT TTT TCC ATC TCA ATC CCT GTG AAG TCA GAC ATT GCT CCT 1636 
Lys Gly His Phe Ser lie Ser He Pro Val Lys Ser Asp He Ala Pro 
525 530 535 

GTC GCT CGG TTG CTC ATC TAT GCT GTT TTA CCT ACC GGG GAC GTG ATT 1684 
Val Ala Arg Leu Leu He Tyr Ala Val Leu Pro Thr Gly Asp Val He 
540 545 550 

GGG GAT TCT GCA AAA TAT GAT GTT GAA AAT TGT CTG GCC AAC AAG GTG 1732 
Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu Ala Asn Lys Val 
555 560 565 

GAT TTG AGC TTC AGC CCA TCA CAA AGT CTC CCA GCC TCA CAC GCC CAC 1780 
Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala Ser His Ala His 
570 575 580 

CTG CGA GTC ACA GCG GCT CCT CAG TCC GTC TGC GCC CTC CGT GCT GTG 1828 
Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala Leu Arg Ala Val 
585 590 595 600 

GAC CAA AGC GTG CTG CTC ATG AAG CCT GAT GCT GAG CTC TCG GCG TCC 1876 
Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu Leu Ser Ala Ser 
605 610 615 

TCG GTT TAC AAC CTG CTA CCA GAA AAG GAC CTC ACT GGC TTC CCT GGG 1924 
Ser Val Tyr Asn Leu Leu Pro Glu Lys Asp Leu Thr Gly Phe Pro Gly 
620 625 630 
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CCT TTG AAT GAC CAG GAC GAT GAA GAC TGC ATC AAT CGT CAT AAT GTC 
Pre, Leu ten Asp Gin Asp Asp Glu Asp Cys He Asn Arg His Asn Val 
635 640 645 

TAT ATT AAT GGA ATC ACA TAT ACT CCA GTA TCA AGT ACA AAT GAA AAG 
lyl IU "n GTy lie llr Tyr Thr Pro Val Ser Ser Thr Asn Glu Lys 
y 650 555 660 

GAT ATG TAC AGC TTC CTA GAG GAC ATG GGC TTA AAG GCA TTC ACC AAC 
AsJ fet lyr Ser Phe Leu Glu Asp Met Gly Leu Lys Ala Phe Thr Asn 
665 670 675 

TCA AAG ATT CGT AAA CCC AAA ATG TGT CCA CAG CTG CAG TCA GTG TCA 
lev L?s lie Arg Lys Pro Lys Met Cys Pro Gin Leu Gin Ser Val Ser 
685 690 osd 

GCC GGC GCC GTG GGA CAG GGA TAT TAT GGA GCC GGA CTG GGA GTG GTG 
Ala Gly Ala Val Gly Gin Gly Tyr Tyr Gly Ala Gly Leu G y Val Val 
700 705 710 

GAG AGG CCT TAT GTG CCT CAG CTG GGT ACC TAT AAT GTG ATC CCT CTG 
Glu Arg Pro Tyr Val Pro Gin Leu Gly Thr Tyr Asn Val He Pro Leu 
715 720 7Z5 

AAT AAT GAG CAG AGC TCA GGA CCT GTG CCT GAG ACA GTG AGG AAG TAT 
tei ten Glu Gin Ser Ser Gly Pro Val Pro Glu Thr Val Arg Lys Tyr 
730 735 740 

TTC CCT GAG ACA TGG ATC TGG GAT CTG GTG GTG GTG AAT TCC GCG GGT 
Phe Pro Glu fhr Trp He Trp Asp Leu Val Val Val Asn Ser Ala Gly 
745 750 755 /«» 

GTG GCT GAG GTA GGA GTA ACA GTC CCT GAC ACC ATC ACC GAG TGG AAG 
Val Ala Glu Val Gly Val Thr Val Pro Asp Thr He Thr Glu Trp Lys 
765 770 'I* 

GCA GGG GCC TTC TGC CTG TCT GAA GAT GCT GGA CTT GGT ATC TCT TCC 
Ala Gly Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly He Ser Ser 
780 785 790 

ACT GCC TCT CTC CGA GCC TTC CAG CCC TTC TTT GTG GAG CTC ACA ATG 
Thr Ala Ser Leu Arg Ala Phe Gin Pro Phe Phe Val Glu Leu Thr Met 
795 " 800 805 

CCT TAC TCT GTG ATT CGT GGA GAG GCC TTC ACA CTC AAG GCC ACG GTC 
Pro Tyr Ser Val He Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val 
810 815 820 

CTA AAC TAC CTT CCC AAA TGC ATC CGG GTC AGT GTG CAG CTG GAA GCC 
Leu ten Tyr Lei Pro Lys Cys He Arg Val Ser Val Gin Leu Glu Ala 
825 830 835 8™ 

tpt rrr GCf TTC CTA GCT GTC CCA GTG GAG AAG GAA CAA GCG CCT CAC 
IVr Pro Ala Se Leu Ala 55 Pro Val Glu Lys Glu Gin Ala Pro His 
845 850 855 
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TGC ATC TGT GCA AAC GGG CGG CAA ACT GTG TCC TGG GCA GTA ACC CCA 2644 
Cys lie Cys Ala Asn Gly Arg Gin Thr Val Ser Trp Ala Val Thr Pro 
860 865 870 

AAG TCA TTA GGA AAT GTG AAT TTC ACT GTG AGC GCA GAG GCA CTA GAG 2692 

Lys Ser Leu Gly Asn Val Asn Phe Thr Val Ser Ala Glu Ala Leu Glu 
875 880 885 

TCT CAA GAG CTG TGT GGG ACT GAG GTG CCT TCA GTT CCT GAA CAC GGA 2740 
Ser Gin Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro Glu His Gly 

890 895 900 

AGG AAA GAC ACA GTC ATC AAG CCT CTG TTG GTT GAA CCT GAA GGA CTA 2788 

Arg Lys Asp Thr Val lie Lys Pro Leu Leu Val Glu Pro Glu Gly Leu 
905 910 915 920 

GAG AAG GAA ACA ACA TTC AAC TCC CTA CTT TGT CCA TCA GGT GGT GAG 2836 

Glu Lys Glu Thr Thr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu 
925 930 935 

GTT TCT GAA GAA TTA TCC CTG AAA CTG CCA CCA AAT GTG GTA GAA GAA 2884 

Val Ser Glu Glu Leu Ser Leu Lys Leu Pro Pro Asn Val Val Glu Glu 
940 945 950 

TCT GCC CGA GCT TCT GTC TCA GTT TTG GGA GAC ATA TTA GGC TCT GCC 2932 

Ser Ala Arg Ala Ser Val Ser Val Leu Gly Asp He Leu Gly Ser Ala 
955 960 965 

ATG CAA AAC ACA CAA AAT CTT CTC CAG ATG CCC TAT GGC TGT GGA GAG 2980 

Met Gin Asn Thr Gin Asn Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu 

970 975 980 

CAG AAT ATG GTC CTC TTT GCT CCT AAC ATC TAT GTA CTG GAT TAT CTA 3028 

Gin Asn Met Val Leu Phe Ala Pro Asn He Tyr Val Leu Asp Tyr Leu 
985 990 995 1000 

AAT GAA ACA CAG CAG CTT ACT CCA GAG ATC AAG TCC AAG GCC ATT GGC 3076 

Asn Glu Thr Gin Gin Leu Thr Pro Glu lie Lys Ser Lys Ala He Gly 
1005 1010 1015 

TAT CTC AAC ACT GGT TAC CAG AGA CAG TTG AAC TAC AAA CAC TAT GAT 3124 

Tyr Leu Asn Thr Gly Tyr Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp 
1020 1025 1030 

GGC TCC TAC AGC ACC TTT GGG GAG CGA TAT GGC AGG AAC CAG GGC AAC 3172 

Gly Ser Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn 
1035 1040 1045 

ACC TGG CTC ACA GCC TTT GTT CTG AAG ACT TTT GCC CAA GCT CGA GCC 3220 

Thr Trp Leu Thr Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala 

1050 1055 1060 

TAC ATC TTC ATC GAT GAA GCA CAC ATT ACC CAA GCC CTC ATA TGG CTC 3268 

Tyr He Phe He Asp Glu Ala His He Thr Gin Ala Leu He Trp Leu 
1065 1070 1075 1080 
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TCC CAG AGG CAG AAG GAC AAT GGC TGT TTC AGG AGC TCT GGG TCA CTG 
Ser Gin Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu 
1085 IO 90 luys> 

CTC AAC AAT GCC ATA AAG GGA GGA GTA GAA GAT GAA GTG ACC CTC TCC 
lei AsS )Sn Ala lie Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser 
1100 1105 I 110 

GCC TAT ATC ACC ATC GCC CTT CTG GAG ATT CCT CTC ACA GTC ACT CAC 
Ala Tyr He Thr He Ala Leu Leu Glu He Pro Leu Thr Val Thr His 
1115 1120 1125 

CCT GTT GTC CGC AAT GCC CTG TTT TGC CTG GAG TCA GCC TGG AAG ACA 
Pro Val Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr 
1130 1135 II 40 

GCA CAA GAA GGG GAC CAT GGC AGC CAT GTA TAT ACC AAA GCA CTG CTG 
Ala Gin ITu Gly Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu 
1145 H50 H55 II 50 

GCC TAT GCT TTT GCC CTG GCA GGT AAC CAG GAC AAG AGG AAG GAA GTA 
Ala Tyr Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val 
1165 H70 II 75 

CTC AAG TCA CTT AAT GAG GAA GCT GTG AAG AAA GAC AAC TCT GTC CAT 
Leu Lys Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His 
1180 H85 il9 ° 

TGG GAG CGC CCT CAG AAA CCC AAG GCA CCA GTG GGG CAT TTT TAC GAA 
Trp Glu Arg Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu 
1195 1200 1Z05 

CCC CAG GCT CCC TCT GCT GAG GTG GAG ATG ACA TCC TAT GTG CTC CTC 
Pro Gin Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu 
1210 1215 1220 

GCT TAT CTC ACG GCC CAG CCA GCC CCA ACC TCG GAG GAC CTG ACC TCT 
Ala Tyr Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser 
1225 1230 1235 1240 

GCA ACC AAC ATC GTG AAG TGG ATC ACG AAG CAG CAG AAT GCC CAG GGC 
Ala Thr Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gl n Gly 
1245 1250 1255 

GGT TTC TCC TCC ACC CAG CAC ACA GTG GTG GCT CTC CAT GCT CTG TCC 
Gly Phe Ser Ser Thr Gin His Thr Val Val Ala Leu His Ala Leu Ser 
1260 1265 1270 

AAA TAT GGA GCA GCC ACA TTT ACC AGG ACT GGG AAG GCT GCA CAG GTG 
Lys Tyr Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val 
1275 1280 1285 

ACT ATC CAG TCT TCA GGG ACA TTT TCC AGC AAA TTC CAA GTG GAC AAC 
Thr lie Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn 
1290 1295 1300 
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AAC AAC CGC CTG TTA CTG CAG CAG GTC TCA TTG CCA GAG CTG CCT GGG 3988 
Asn Asn Arg Leu Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro Gly 
1305 1310 1315 1320 

GAA TAC AGC ATG AAA GTG ACA GGA GAA GGA TGT GTC TAC CTC CAG ACA 4036 
Glu Tyr Ser Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr 
1325 1330 1335 

TCC TTG AAA TAC AAT ATT CTC CCA GAA AAG GAA GAG TTC CCC TTT GCT 4084 
Ser Leu Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala 
1340 1345 1350 

TTA GGA GTG CAG ACT CTG CCT CAA ACT TGT GAT GAA CCC AAA GCC CAC 4132 
Leu Gly Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His 
1355 1360 1365 

ACC AGC TTC CAA ATC TCC CTA AGT GTC AGT TAC ACA GGG AGC CGC TCT 4180 
Thr Ser Phe Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser 
1370 1375 1380 

GCC TCC AAC ATG GCG ATC GTT GAT GTG AAG ATG GTC TCT GGC TTC ATT 4228 
Ala Ser Asn Met Ala He Val Asp Val Lys Met Val Ser Gly Phe He 
1385 1390 1395 1400 

CCC CTG AAG CCA ACA GTG AAA ATG CTT GAA AGA TCT AAC CAT GTG AGC 4276 
Pro Leu Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser 
1405 1410 1415 

CGG ACA GAA GTC AGC AGC AAC CAT GTC TTG ATT TAC CTT GAT AAG GTG 4324 
Arg Thr Glu Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val 
1420 1425 1430 

TCA AAT CAG ACA CTG AGC TTG TTC TTC ACG GTT CTG CAA GAT GTC CCA 4372 
Ser Asn Gin Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro 
1435 1440 1445 

GTA AGA GAT CTG AAA CCA GCC ATA GTG AAA GTC TAT GAT TAC TAC GAG 4420 
Val Arg Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp Tyr Tyr Glu 
1450 1455 * 1460 

ACG GAT GAG TTT GCA ATT GCT GAG TAC AAT GCT CCT TGC AGC AAA GAT 4468 
Thr Asp Glu Phe Ala He Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp 
1465 1470 1475 1480 

CTT GGA AAT GCT TGAAGACCAC AAGGCTGAAA AGTGCTTTGC TGGAGTCCTG 4520 
Leu Gly Asn Ala 



TTCTCTGAGC TCCACAGAAG ACACGTGTTT TTGTATCTTT AAAGACTTGA TGAATAAACA 4580 
CTTTTTCTGG TCAAAAAAA 4599 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1484 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(E) FEATURES: bait region: 690-740 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Gly Lys Asn Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu 
1 5 10 

Val Leu Leu Pro Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met 
20 25 30 

Val Leu Val Pro Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys 
35 40 45 

Val Leu Leu Ser Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu 
50 55 60 

Glu Ser Val Arg Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu 
65 70 75 80 

Asn Asp Val Leu His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser 
85 90 95 

Asn Glu Glu Val Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin 
100 105 110 

Glu Phe Lys Lys Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu 
115 120 125 

Val Phe Val Gin Thr Asp Lys Ser He Tyr Lys Pro Gly Gin Thr Val 
130 135 140 

Lys Phe Arg Val Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu 
145 " 150 155 160 

Leu He Pro Leu Val Tyr He Gin Asp Pro Lys Gly Asn Arg lie Ala 
165 170 175 

Gin Trp Gin Ser Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe 
180 185 190 

Pro Leu Ser Ser Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin 
195 200 205 

Lys Lys Ser Gly Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe 
210 215 220 

Val Leu Pro Lys Phe Glu Val Gin Val Thr Val Pro Lys He He Thr 
225 230 235 240 

He Leu Glu Glu Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr 
245 250 255 

Gly Lys Pro Val Pro Gly His Val Thr Val Ser He Cys Arg Lys Tyr 
260 265 270 
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Ser Asp Ala Ser Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu 
275 280 285 

Lys Phe Ser Gly Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val 
290 295 300 

Lys Thr Lys Val Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu 
305 310 315 320 

His Thr Glu Ala Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr 
325 330 335 

Gly Arg Gin Ser Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe 
340 345 350 

Val Lys Val Asp Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin 
355 360 365 

Val Arg Leu Val Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He 
370 375 380 

Phe He Arg Gly Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp 
385 390 395 400 

Glu His Gly Leu Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly 
405 410 415 

Thr Ser Leu Thr Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr 
420 425 ' 430 

Gly Tyr Gin Trp Val Ser Glu Glu His Glu Glu Ala His His Thr Ala 
435 440 445 

Tyr Leu Val Phe Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met 
450 455 460 

Ser His Glu Leu Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr 
465 470 % 475 480 

He Leu Asn Gly Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr 
485 490 495 

Tyr Leu He Met Ala Lys Gly Gly He Val Arg Thr Gly Thr His Gly 
500 505 510 

Leu Leu Val Lys Gin Glu Asp Met Lys Gly His Phe Ser He Ser He 
515 520 525 

Pro Val Lys Ser Asp lie Ala Pro Val Ala Arg Leu Leu He Tyr Ala 
530 535 540 

Val Leu Pro Thr Gly Asp Val He Gly Asp Ser Ala Lys Tyr Asp Val 
545 550 555 560 

Glu Asn Cys Leu Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin 
565 570 575 
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Ser Leu Pro Ala Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin 
580 585 590 

Ser Val Cys Ala Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys 
595 " 600 605 

Pro Asp Ala Glu Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu 
610 615 620 

Lys Asp Leu Thr Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu 
625 630 635 640 

Asp Cys He Asn Arg His Asn Val Tyr He Asn Gly He Thr Tyr Thr 
645 650 655 

Pro Val Ser Ser Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp 
660 665 670 

Met Gly Leu Lys Ala Phe Thr Asn Ser Lys He Arg Lys Pro Lys Met 
675 680 685 

Cys Pro Gin Leu Gin Ser Val Ser Ala Gly Ala Val Gly Gin Gly Tyr 
690 695 700 

Tyr Gly Ala Gly Leu Gly Val Val Glu Arg Pro Tyr Val Pro Gin Leu 
705 710 715 720 

Gly Thr Tyr Asn Val He Pro Leu Asn Asn Glu Gin Ser Ser Gly Pro 
725 730 735 

Val Pro Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Trp He Trp Asp 
740 ~ 745 750 

Leu Val Val Val Asn Ser Ala Gly Val Ala Glu Val Gly Val Thr Val 
755 760 765 

Pro Asp Thr He Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu Ser Glu 
770 775 780 

Asp Ala Gly Leu Gly He Ser Ser Thr Ala Ser Leu Arg Ala Phe Gin 
785 790 795 800 

Pro Phe Phe Val Glu Leu Thr Met Pro Tyr Ser Val He Arg Gly Glu 
805 810 815 

Ala Phe Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys Cys He 
820 825 830 

Arq Val Ser Val Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala Val Pro 
835 840 845 

Val Glu Lys Glu Gin Ala Pro His Cys lie Cys Ala Asn Gly Arg Gin 
850 855 860 

Thr Val Ser Trp Ala Val Thr Pro Lys Ser Leu Gly Asn Val Asn Phe 
865 870 875 880 
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Thr Val Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly Thr Glu 
885 890 895 

Val Pro Ser Val Pro Glu His Gly Arg Lys Asp Thr Val He Lys Pro 
900 905 910 

Leu Leu Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe Asn Ser 
915 920 925 

Leu Leu Cys Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser Leu Lys 
930 935 940 

Leu Pro Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val Ser Val 
945 950 955 960 

Leu Gly Asp He Leu Gly Ser Ala Met Gin Asn Thr Gin Asn Leu Leu 
965 970 975 

Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe Ala Pro 
980 985 990 

Asn lie Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu Thr Pro 
995 1000 1005 

Glu He Lys Ser Lys Ala He Gly Tyr Leu Asn Thr Gly Tyr Gin Arg 
1010 1015 1020 

Gin Leu Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe Gly Glu 
1025 1030 1035 1040 

Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe Val Leu 
1045 1050 1055 

Lys Thr Phe Ala Gin Ala Arg Ala Tyr He Phe He Asp Glu Ala His 
1060 1065 1070 

lie Thr Gin Ala Leu He Trp Leu Ser Gin Arg Gin Lys Asp Asn Gly 
1075 1080 " 1085 

Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly 
1090 1095 1100 

Val Glu Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala Leu Leu 
1105 1110 1115 1120 

Glu He Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala Leu Phe 
1125 1130 1135 

Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser 
1140 1145 1150 

His Val Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu Ala Gly 
1155 1160 1165 

Asn Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu Glu Ala 
1170 1175 1180 
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Val Lys Lys Asp Asn Ser Val His Trp Glu Arg Pro Gin Lys Pro Lys 
1185 1190 US 5 " UU 

Ala Pro Val Gly His Phe Tyr Glu Pro Gin Ala Pro Ser Ala Glu Val 
1205 1210 

Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin Pro Ala 
1220 1225 1230 

Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys Trp lie 
1235 1240 1245 

Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin His Thr 
1250 1255 1260 

Val Val Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr Phe Thr 
1265 1270 1275 

Arg Thr Gly Lys Ala Ala Gin Val Thr IUGln Ser Ser Gly Thrjhe 

Ser Ser Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu Gin Gin 
1300 1305 1310 

Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val Thr Gly 
1315 1320 13Zb 

Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He Leu Pro 
1330 1335 1340 

Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly Val Gin Thr Leu Pro Gin 
1345 1350 1355 1360 

Thr Cys Asp Glu Pro Lys Ala His Thr Ser Phe Gin He Ser Leu Ser 
1365 1370 1375 

Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He Val Asp 
1380 1385 1390 

Val Lys Met Val Ser Gly Phe He Pro Leu Lys Pro Thr Val Lys Met 
1395 1400 1^05 

Leu Glu Arg Ser Asn His Val Ser Arg Thr Glu Val Ser Ser Asn His 
1410 1415 1420 

Val Leu He Tyr Leu Asp Lys Val Ser Asn Gin Thr Leu Ser Leu Phe 
1425 1430 1435 144U 

Phe Thr Val Leu Gin Asp Val Pro Val Arg Asp Leu Lys Pro Ala He 
1445 1450 1455 

Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp Glu Phe Ala He Ala Glu 
1460 1465 14/0 

Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly Asn Ala 
1475 1480 
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PATENT CLAIMS 

1. A process for the production of recombinant a-macroglobul in, 
variants, fragments or derivatives thereof, wherein a functionally operative 
expression vector comprising a gene encoding for the expression of a- 

5 macroglobul in, variants, fragments or derivatives thereof, or alleles of 
such a gene, is introduced into a suitable host capable of expressing said 
gene, said host is cultured in a suitable nutrient medium containing sources 
of assimilable carbon and nitrogen and other essential nutrients, and the 
expressed a-macroglobul in or fragments or derivatives thereof is recovered. 

10 

2. The process of claim 1, wherein said gene encodes for the 
expression of human a 2 -macroglobul in, variants, fragments or derivatives 
thereof. 

15 3. The process of claim 2, wherein said gene encodes for the 

expression of human a 2 -macroglobul in having the amino acid sequence of SEQ 
ID NO: 2, or a fragment or derivative thereof. 

4. The process of claim 2 or 3, wherein said gene comprises the DNA 
20 sequence of SEQ ID NO:l, or a fragment thereof. 

5. The process of claim 1 or 2, wherein said gene encodes for a 
variant a-macroglobul in, in which the amino acid sequence of the bait region 
has been altered. 

25 

6. The process of claim 5, wherein the bait region has been altered 
by incorporation of further proteinase target sites. 

7. The process of claim 5, wherein the bait region has been altered 
30 by removal of proteinase target sites. 

8. The process of claim 5, wherein the bait region has been altered 
by replacing one or more specific proteinase target sites with one or more 
other specific proteinase target sites. 

35 

9. The process of claim 8, wherein said proteinase target sites are 
specific for bovine trypsin, Streptomvces griseus trypsin, papain, porcine 
elastase, bovine chymosin, bovine chymotrypsin, Staphylococcus aureus strain 
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V8 proteinase, human plasmin, bovine thrombin, thermolysin, subtilisin Novo 
and/or strantomvc es ariseus proteinase B. 

10 The process of claim 5, wherein wherein the bait region has been 
5 altered by replacing said bait region or part thereof with a bait region or 

a part thereof from another a-macroglobulin. 

11 The process of claim 10, wherein said bait regions originate from 
human c^[, Pregnancy Zone Protein (PZP), rat o,M, rat a 2 M, rat variant 

10 1, or rat M, variant 2 (a,I 3 = ^-inhibitor 3), especially PZP. 

12. The process of any of claims 5 to 11, wherein said gene encodes 
for the expression of human a a 2 -macroglobul in variant having the amino acid 
sequence of SEQ ID NO: 4, or a fragment or derivative thereof. 

15 

13. The process of any of claims 5 to 12, wherein said gene comprises 
the DNA sequence of SEQ ID NO:3, or a fragment thereof. 

14. The process of any of the claims 1 to 13, wherein said gene is 
20 a synthetic gene. 

15 The process of any of the claims 1 to 14, wherein said a- 

macroglobulin, variant, fragment or derivative thereof is co-expressed with 
a desired gene product. 

25 

16. The process of any of the claims 1 to 15, wherein said gene is, 
or is derived from, a human gene. 

17. The process of any of the claims 1 to 16, wherein said host is 
30 a bacterial strain, a fungal strain, a mammalian cell line, or a mammal. 

18. The process of claim 17, wherein said host is a fungus. 

19. The process of claim 18, wherein said fungus belongs to the genus 
35 Aspergillus . 



20 



The process of claim 18, wherein said host is a yeast. 
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21. The process of claim 20, wherein said yeast belongs to the genus 

Saccharomvces . 

22- The process of claim 17, wherein said host is a mammalian cell 

5 1 i ne . 

23. The process of claim 22, wherein said mammalian cell line is a 

Syrian Baby Hamster Kidney (BKH) cell line. 

10 24. The process of claim 23, wherein said cell line is available from 

ATCC under No. CRL 1632. 

25. A DNA sequence comprising a gene encoding for the expression of 

an a-macroglobul in, variants, fragments or derivatives thereof. 

15 

26 The DNA sequence of claim 25, wherein said gene encodes for human 

a2"-macroglobul in. 

27. The DNA sequence of claim 25, wherein said gene encodes for the amino 
20 acid sequence of SEQ ID N0:2 or a fragment or derivative thereof. 

28. The DNA sequence of claim 26 or 27, wherein said gene has the 
nucleotide sequence of SEQ ID N0:1 or a fragment thereof. 

25 29. The DNA sequence of claim 25 or 26, wherein said gene encodes 

for a variant a-macroglobul in, in which the amino acid sequence of the bait 
region has been altered. 

30. The DNA sequence of claim 29, wherein said bait region has been 
30 altered by incorporation of further proteinase target sites. 

31. The DNA sequence of claim 29, wherein said bait region has been 
altered by removal of proteinase target sites. 

35 32. The DNA sequence of claim 29, wherein said bait region has been 

altered by replacing one or more specific proteinase target sites with one 
or more other specific proteinase target sites. 
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33. The DNA sequence of claim 29, wherein, wherein said proteinase 
target sites are specific for bovine trypsin, Strppt.omyr.es griseus trypsin, 
papain, porcine elastase, bovine chymosin, bovine chymotrypsin, Staphylococ- 
cus aureus strain V8 proteinase, human plasmin, bovine thrombin, thermoly- 
sin, subtil i sin Novo and/or Streptomvces griseus proteinase B. 

34. The DNA sequence of claim 29, wherein the bait region has been 
altered by replacing said bait region or part thereof with a bait region or 
a part thereof from another a-macroglobul in. 

35. The DNA sequence of claim 34, wherein said bait region originates 
from human a 2 M, Pregnancy Zone Protein (PZP), rat o,M, rat a^, rat a,l s 
variant 1, or rat o,I 3 variant 2, especially PZP. 

15 35> a functionally operative expression vector comprising a gene in 

accordance with any of the claims 25 to 35 for the expression of human a z - 
macroglobulin, variants, fragments or derivatives thereof, or alleles of 
such a gene. 

20 37. The vector of claim 35, further comprising regulatory elements 

necessary for the stable maintenance of said vector in mammalian cells. 

38. The vector of claim 36 or 37, further comprising sequences 
providing for the processing and secretion of the expressed product. 

39. The vector of any of the claims 36 to 38, further comprising one 
or more other genes encoding for a desired gene product. 

40. A functionally operative expression vector comprising a gene 
30 encoding for the expression of an a-macroglobul in, variants, fragments or 

derivatives thereof, or alleles of such a gene, essentially as described. 

41. A transformed host comprising a functionally operative expression 
vector comprising a gene encoding for the expression of human a a -macro- 

35 globulin or fragments or derivatives thereof, or alleles of such a gene. 

42. The host of claim 41, wherein said vector is the vector of any 
of the claims 36 to 40. 



25 
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43. The host of claim 41 or 42, wherein said host is a bacterial 
strain, a fungal strain, a mammalian cell line, or a mammal. 

44. The host of claim 43, wherein said host is a fungus. 

5 

45. The host of claim 44, wherein said fungus belongs to the genus 
Aspergillus . 

46. The host of claim 44, wherein said host is a yeast. 

10 

47. The host of claim 46, wherein said host belongs to the genus Sac- 
charomvces . 

48. The host of claim 43, wherein said host is a mammalian cell line. 

15 

49. The host of claim 48, wherein said host is a Syrian Baby Hamster 
Kidney (BHK) cell line. 

50. The host of claim 49, wherein said cell line is available from 
20 ATCC under No. CRL 1632. 

51. Recombinant human a 2 -macroglobul in of SEQ ID NO: 2 or SEQ ID NO: 4 
in an active form. 

25 52. Recombinant a-macroglobul in, variants, fragments or derivatives 

thereof produced by a process of any of the claims 1 to 24. 

53. Recombinant a-macroglobul in, variants, fragments or derivatives 
thereof of claim 52 produced by the use of a vector of any of the claims 36 

30 to 40. 

54. Recombinant a-macroglobul in, variants, fragments or derivatives 
thereof essentially as described. 

35 55. Recombinant human a 2 -macroglobul in, variants, fragments or 

derivatives thereof essentially as described. 

56. A growth medium comprising one or more a-macroglobul ins. 
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57. A growth medium comprising recombinant a-macrogl obul in, variants, 
fragments or derivatives thereof according to any of the claims 51 to 55. 

58. Use of recombinant a-macrogl obul in, variants, fragments or 
5 derivatives thereof according to any of the claims 51 to 55 as a protein 

carrier in enzyme replacement therapy. 

59> use of recombinant a-macrogl obul in, variants, fragments or 

derivatives thereof according to any of the claims 51 to 55 as a DNA carrier 
10 in gene therapy. 
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